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POTASSIUM CHANNELS, NUCLEOTIDE SEQUENCES ENCODING 
THEM, AND METHODS OF USING SAME 

32,42 1-C2 

The application is a continuation-in-part of co-pending 
PCT/US95/14364 filed on October 25, 1995 which is a continuation-in-part of U.S. 
Serial No. 332,312 filed onOctober31, 1994, now U.S. Patent No. 5,559,026, issued 
September 24, 1996. 

Field of Invention 
This invention relates generally to a new family of potassium channels. 

More particularly, the present invention relates to the cloning and characterization of 

a family of distinct trans-membrane potassium ion channels, characterization of such 

channels, newly identified polynucleotide sequences, polypeptides encoded by such 

sequences, expression vectors capable of heterologous expression of such 

polynucleotide sequences, transformed host cells containing the expression vectors 

and assay methods for determining the expression of heterologous nucleotide 

sequences encoding all or a portion of said potassium channels in host cells, 

chromosome mapping, diagnostic methodologies and kits therefor. Genes encoding 

potassium channels representative of this family were cloned from Drosophila 

melanogaster, Caenorhabditis elegans, human and mouse ESTs, and human brain, 

heart, and kidney cDNA libraries. More particularly, the invention arises in part from 

the determination that the DNA sequences of these genes encode a structurally distinct 
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potassium channel whose molecular architecture is characterized by four membrane 
spanning domains and two putative pore forming domains. 

Background of the Invention 

Ion channels, which include sodium (Na + ), potassium (K + ), and 
calcium (Ca^), are present in both eukaryotic and prokaryotic cells and control a 
variety of physiological and pharmacological processes. Potassium channels 
comprise a large and diverse group of integral membrane proteins that are involved in 
the movement of potassium into and out of the cell. Such channels regulate the level 
of excitability and repolarization properties of neurons and muscle fibers [B. Hille, 
Ionic Channels of Excitable Membranes, 2d Ed., Sinauer, Sunderland, MA (1992)] 
and are implicated in a broad spectrum of processes in both excitable and non-excit- 
able cells. In almost all cells, K + channels play a role in determining the resting elec- 
trical membrane potential by setting the membrane permeability to K + ions. Potassi- 
um currents have been shown to be more diverse than sodium or calcium currents and 
play a role in determining the way a cell responds to external stimuli. 

Several classes of K + channels have been identified based on their 
pharmacological and electrophysiological properties; these include voltage-gated, 
ATP-sensitive, muscarinic-activated, S type, SK Ca^-activated, Na + -activated, and 
inward and/or outward rectifier types of K + channels. Prior to this work, and on the 
basis of membrane-spanning segments, potassium channels may be subdivided into 
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topologically distinct classes. For example, one well-known class of voltage-gated, 
calcium activated, and/or cyclic nucleotide-gated-channels is composed of six 
membrane scanning domains (S1-S6) one of which contains repeated positive charges 
presumed to be involved in the voltage sensing of these channels and hence in their 
functional outward rectification and a single pore forming domain (H5 or P region). 
A second class may be described as an inward rectifying potassium channel that 
passes through the cellular membrane twice and also contains a single pore forming 
region [Y. Kubo, E. Reuveny, P. A. Slesinger, Y.N. Jan, L.Y. Jan, Nature 364, 802- 
806 (1993); Y. Kubo, TJ. Baldwin, Y.N. Jan, L.Y. Jan, Nature 362, 127-133 (1993); 
see also American Cyanamid copending U.S. patent application # 08/431,928 filed on 
6/28/1995 for a description of "HIRK"]. 

The best characterized class of K + channels are the voltage-gated 
outward rectifying channels (the K v family), the prototype being the protein which is 
coded for by the Shaker gene seen in Drosophila rnelanogaster, which is a voltage- 
gated channel. The proteins in this gene family contain a structural motif 
characterized by six membrane spanning segments (S1-S6), a putative voltage sensor 
(S4), and an S5-S6 linker (H5 or P region) involved in ion conductance. A functional 
channel is assembled in the membrane via the association of four Shaker subunits, 
necessitating the presence of four P domains. 

Another well characterized class of potassium channel proteins, the 
inward rectifier potassium channels (K ir family) play a significant role in maintaining 
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the resting potential of, and in controlling the excitability of a cell. These channels 
are characterized by two transmembrane domains and a pore-forming region and the 
lack of an S4 or voltage sensing region. Inward rectifying K + channels are generally 
characterized by two transmembrane domains and one pore-forming domain. The 
pore-forming domain is common to both groups of K + channels, the voltage-gated 
outward rectifier groups and the inward rectifying K + channels and is an essential 
element of the aqueous K + -selective pore. A functional channel is assembled in the 
membrane via the association of four K ir subunits, necessitating the presence of four P 
domains. 

A potassium channel from Saccharomyces cerevisiae designated Tokl, 
[Ketchum et al, Nature 376, 690-695 (1995)] or YORK [Lesage et al, J. Biol. Chem 
271 , 4183-4187 (1996)] has recently been identified and is characterized by the 
presence of two pore (2P) domains and an outward rectifying K + -selective current 
which is coupled to potassium equilibrium [Ketchum et ah. Nature 376 , 690-695 
(1995)]. In contrast to the other channels described, the yeast channel comprises eight 
transmembrane domains, such domains resembling an assembly of an inward 
rectifying K + channel of the K ir family (two transmembrane domains) with an outward 
rectifying channel of the K v family (six transmembrane domains). 

A channel with four transmembrane domains and two pore-forming 
regions has recently been described by the present inventors [Goldstein, S. et al, Proc. 
Natl. Acad. Sci. USA 93 13256-13261 (1996) - "DmORFl" (also referred to as ORK1 
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or DORK)]. Other Investigators have described additional members of this potassium 
channel family [Fink, M. et al, EMBO J. 15, 6854-6862 (1996) - "TREK"; Lesage et 
aL, EMBO Journal, 15, 1004-101 1 (1996) - "TWIK-1"; Lesage F. et aL, FEBS Lett. 
402 . 28-32 (1997)]. It has also been postulated that eight potassium channel families 
have been revealed by the C elegans genome project, Wei A., et al. 9 
Neuropharmacology 35, No. 7, 805-829 (1996). 

Summary of the Invention 
A first aspect of the present invention is the discovery of a new family 
of potassium channel genes and proteins encoded thereby. Potassium channels 
belonging to this new family comprise four hydrophobic domains capable of forming 
transmembrane helices, wherein a first pore-forming domain is interposed between the 
first and second transmembrane helices and a second pore-forming domain is 
interposed between the third and fourth transmembrane helices, and the channels 
further contain various potassium selective peptide motifs. In preferred embodiments, 
the channels contain a GXG motif in the first pore-forming region and preferably in 
both pore-forming regions, wherein X is an amino acid selected from the group 
consisting of Y, F, V, I, M, and L, and particularly L or I. The channels preferably 
contain a further peptide motif in the Pj and/or P 2 pore-forming regions, spanning 
several amino acids upstream of GXG, and particuarly for about six (6) amino acids 
upstream of the first G. Thus, the preferred pore-forming region motif is 
ZXXZ X Z 2 Z^GXG where Z, Z t and Z 2 are preferably the amino acids residues T or S 
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and Z 3 is preferably I or V, and X is as described above, again, with the amino acid 
residues L or I particularly preferred. 

In further preferred embodiments, the channels display yet a second 
peptide motif, Z 4 X 1 X2X3GX 4 PX5, wherein Z 4 is the amino acid residue Y or F and 
preferably Y, and X 1? X 2? X 3 , and X 4 are amino acid residues, wherein X x residues are 
A, S ? or G, with A or S preferred; and X 2 through X 5 are the amino acid residues M, I, 
V, L, F, or Y, with L or I particularly preferred. In certain embodiments, this motif is 
"YALLGIP." This second peptide motif is located downstream of P x generally about 
12-25 amino acids downstream, and preferably about 16 amino acids downstream of 

Pi- 

In certain preferred embodiments, the isolation and characterization of 
invertebrate (i.e. insect and nematode) potassium channel genes belonging to this new 
family is presented. In more preferred embodiments, the present invention further 
provides the isolation and characterization of polynucleotides from invertebrates and 
vertebrates, which encode amino acid sequence elements unique to this potassium 
gene family and specifically sourced from Drosophila melanogaster, Caenorhabditis 
elegans, avian libraries, murine and various other mammalian libraries, and libraries 
from all human tissues including human heart and brain. 

A third aspect of the present invention is a method of controlling 
nematode and insect pests by inhibiting or activating potassium channels substantially 
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homologous to those encoded by nucleotide sequences as presented herein. Another 
aspect of the present invention is to influence and alleviate human disease states 
modulating membrane potential with therapeutic agents that interact with the 
potassium channels biologically equivalent to those encoded by nucleotide sequences 
as encoded herein. 

Various screening assay embodiments are also presented herein as well 
as chromosome identification and mapping techniques, diagnostic methodologies and 
kits therefore, and transgenic animals. 

Brief Description of the Drawings 

FIGURE 1. Growth of CY162 cells bearing pDmORFl. CY162 cells transformed 
with plasmids isolated from survivors of a primary library screen for plasmids that 
support the growth of CY162 on medium contain low potassium concentration. Six 
individual transformants of each plasmid-bearing strain are cultured in patches on the 
indicated medium. CY162 cells bearing pDmORFl are found in the upper left-hand 
corner of each plate while pKATl containing cells are found in the lower right hand 
corner. 

FIGURE 2 A and 2B. DNA sequence and deduced amino acid sequence of Dm OKF1 
[SEQ ID NOS:l and 2]. The nucleotide sequence of the 2.4 kb cDNA revealed a 
single long open reading frame proximal to the GAL1 promoter. Segments 
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corresponding to putative transmembrane (M1-M4) and pore-forming H5 domains in 
the predicted polypeptide are underlined. The single amino-terminal asparagine 
linked glycosylation site is indicated by a G. 

FIGURE 3 A and 3B. DNA sequence and deduced amino acid sequence of the 
F22b7.7 segment of the Caenorhabditis elegans genome [SEQ ID NO:3]. Segments 
corresponding to putative transmembrane (M1-M4) and pore-forming H5 domains in 
the predicted polypeptide are underlined. 

FIGURE 4. Alignment of DmORFl and F22b7.7 sequences. Protein-coding regions 
of DmORFl [SEQ ID NO: 37] and F22b7.7 [SEQ ID NO: 38] (designated as CeORF- 
1 in this FIGURE) are compared using the protein sequence alignment algorithm in 
Genework DNA sequence analysis software. Identical amino acids are boxed. 

FIGURE 5 A. Comparison of the pore-forming domains of DmORFl and F22b7.7. 
Amino acid sequences from the six cloned Drosophila melanogaster potassium 
channels and three inward rectifier channels [SEQ ID NOS:7 through 21] are 
compared to DmORFl and F22b7.7 within the pore-forming H5 regions. Amino acid 
identities are indicated by a vertical line and conserved substitutions indicated by a 
dot. Amino acid substitutions deemed acceptable are indicated. 
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FIGURE 5B. Hydropathy plot analysis of the DmORFl and F22b7.7 polypeptide 
sequence. The Kyte-Doolittle hydropathy algorithm in the Geneworks DNA analysis 
software is used to predict the topology of DmORFl and F22b7.7. The position of 
predicted membrane spanning domains (M1-M4) and pore-forming domains are 
indicated. 

FIGURE 6. Predicted membrane spanning topology of DmORFl. 

FIGURE 7. Heterologous potassium channel-dependent growth of plasmid bearing 
CY162 (trkA) strains. CY162 bearing pYES2, pKATl, pDmORFl, and pRATRAK 
are cultured at 30°C for four days on arginine phosphate agar medium containing 0 
mM, 0.2 mM, or 100 mM added KC1. 

FIGURE 8. Inhibition of growth of yeast cells containing heterologous potassium 
channels. CY162 cells (10 5 ) bearing the indicated plasmids are plated in arginine 
phosphate agar medium containing 0.2 mM potassium chloride. Sterile filter disks 
were placed on the surface of the agar and saturated with 20 \il of a 1 M solution of 
potassium channel blocking compound. Clockwise from upper left-hand corner is 
BaCl 2 , CsCl, TEA, and RbCl. KC1 is applied to the center disk. 
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FIGURE 9 A and 9B. DNA sequence and deduced amino acid sequence of CORK 
[SEQ ID NO: 36]. The nucleotide sequence of the 1 .4 kb cDNA revealed a single 
long open reading frame proximal to the GAL1 promoter. Segments corresponding to 
pore-forming H5 domains in the predicted polypeptide are underlined. Asparagine- 
linked glycosylation sites are indicated by a G. 

Figure 10. Depicts a schematic representation of a preferred motif of the potassium 
channels of the invention. 

Detailed Description of the Invention 
Nucleotide bases are abbreviated herein as follows: 
Ade; A- Adenine G-Guanine Ura; U-Uracil 

C-Cytosine; T-Thymine; Ino; I or N (Inosine ~ bonds to any of the others) 

Amino acid residues are abbreviated herein to either three letters or a single letter as 
follows: 

Ala;A- Alanine Leu;L-Leucine 
Arg;R-Arginine Lys;K-Lysine 
Asn;N-Asparagine Met;M-Methionine 
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Asp;D-Aspartic acid Phe;F-Phenylalanine 
Cys;C-Cysteine Pro;P-Proline 
Gln;Q-Glutamine Ser;S-Serine 
Glu;E-Glutamic acid Thr;T-Threonine 
Gly;G-Glycine Trp;W-Tryptophan 
His;H-Histidine Tyr;Y-Tyrosine 
Ile;I-Isoleucine Val;V- Valine 

The term "mammalian" as used herein refers to any mammalian species 
(e.g., human, mouse, rat, and monkey). 

The term "heterologous" as used herein refers to nucleotide sequences, 
proteins, and other materials originating from organisms other than the host organism 
used in the expression of the potassium channels or portions thereof, or described herein 
(e.g., mammalian, avian, amphibian, insect, plant), or combinations thereof not naturally 
found in the host organism. 

The terms "upstream" and "downstream" are used herein to refer to the 
direction of transcription and translation, with a sequence being transcribed or translated 
prior to another sequence being referred to as "upstream" of the latter. 
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The term "channel" and the nucleotide sequences encoding same, is 
intended to encompass all potassium channels, and mutants, derivatives, homologs, and 
other variations thereof. 

The term "EST" as used herein refers to an expressed sequence tag. 

Here we report the cloning and functional expression of a novel family 
of potassium channels exhibiting a unique topological configuration, and 
demonstrating particular physiological characteristics. Potassium channels belonging 
to this family may be derived from a wide variety of animal species, both vertebrate 
and invertebrate. This family is structurally and functionally novel, as manifested by 
the presence of two-pore forming domains (2P) in conjunction with a four membrane 
spanning domain configuration. Nucleotide sequences encoding various 
representative members of this new family of two-pore K + channels were cloned by 
expression in yeast cells from Drosophila melanogaster (dORK or DmORF), and 
also by degenerate PCR from human brain, heart, and kidney cDNA (hORKl), and 
from human and mouse ESTs. Preliminary analyses of expression by a northern 
blotting procedure indicates that hORKl is present primarily in human brain. Genes 
encoding structural homologues are present in the genome of Drosophila 
melanogaster (dORK), Caenorhabditis elegans (cORK), avian tissue and various 
mammalian tissue such as human (hORKl) and murine. 
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The potassium channel family of the present invention may be structurally 
characterized in that the potassium channels have four hydrophobic domains capable of 
forming transmembrane helices. These channels are further characterized in that they 
comprise two pore-forming domains, one of which is interposed between said first helix 
and said second helix, and the other of which is interposed between said third helix and 
said fourth helix. While the present inventors do not wish to be bound by theory, it is 
hypothesized that the 2P channels organize as dimers in the plasma membrane, consistent 
with a requirement for four (4P) domains to form a functional channel. The pore-forming 
domains further contain a potassium selective motif which serves to confer upon the 
channel the ability to pass potassium ions to the exclusion of other ions, such as sodium, 
calcium, and the like. In certain preferred embodiments, this motif contains the peptide 
Y/G, and particularly in either a dipeptide or tripeptide motif, and frequently with Y/F-G 
bonding. In more preferred embodiments, the motif comprises GXG, wherein X is an 
amino acid selected from the group consisting of V, L, Y, F, M, and I, and preferably L or 
I, such motif generally being found between the first two transmembrane domains. In 
certain other motif configurations, a second GXG motif, wherein X is an amino acid 
selected from the aforementioned group, is found between the third and fourth 
transmembrane domain as well. The channels preferably contain a further peptide motif 
in the P x and/or P 2 pore-forming regions, spanning several amino acids upstream of GXG, 
and particuarly for about six (6) amino acids upstream of the first G. Thus, the preferred 
pore-forming region motif is ZXXZ X Z 2 Z 3 GXG where Z, Z x and Z 2 are preferably the 
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amino acids residues T or S and Z 3 is preferably I or V, and X is as described above, 
again, with the amino acid residues L or I particularly preferred. 

In yet further embodiments, the potassium channels of the invention 
comprise a second peptide motif, which in terms of the DNA encoding it, is located 
downstream of the first GXG motif, and within the second transmembrane domain (see 
Figure 13 for a schematic depiction). This is the Z 4 X X X2X 3 GX 4 PX 5 motif wherein Z 4 is 
the amino acid residue Y or F and preferably Y, and X is an amino acid residue wherein 
Xj is A, S, or G with A or S preferred, and X 2 through X 5 are the amino acid residues M, 
I, V, L, F, or Y, with L or I particularly preferred. In other embodiments, the preferred 
Z4XJX2X3GX4PX5 motif is flanked by the first GXG motif (that is located between the 
first and second transmembrane domain) and is located in the second transmembrane, and 
a second pore-forming peptide motif is located downstream of the first pore-forming 
motif, between the third and fourth transmembrane domains. In preferred embodiments, 
the preferred Z 4 X X X2X3GX4PX5 motif is located downstream of the first pore-forming 
peptide motif by about 12-25 amino acids. In other preferred embodiments the first pore- 
forming peptide motif is within about 16 amino acids. In general, the topological 
configuration of the potassium channels of the invention is such that one may presume 
that a regulatory domain of indeterminate length often may be interposed between the 
second transmembrane domain (TM2) and the third transmembrane domain (TM3). Thus, 
the size and characteristics of this domain may vary with cell type and needs, and is 
thereby a structure that is conducive to the conveyance of biological flexibility to the 
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requirements and function of a particular cell. In certain embodiments, Z 4 X 1 X2X3GX 4 PX 5 
comprise the amino acids YALLGX 4 P, and particularly "YALLGIP." 

In other embodiments, the potassium channels of the present invention 
further comprise a glycosylation site. This site may be an amino-terminal glycosylation 
site and may also be asparagine-linked. 

The potassium channels of the present invention possess certain properties 
in common with known potassium channels including, voltage-gated channels, calcium 
activated channels, cyclic nucleotide gated channels, inward rectifier channels, and the 
like, and especially with regard to electrophysiological properties. However, a hallmark 
of the potassium channels of the invention are that they exhibit either outward current 
rectification or both inward and outward current rectification, in each case affected by 
potassium concentration. 

Potassium channels play an essential role in determining the resting 
electrical membrane potential by setting the membrane permeability to K + ions. The 
cloned 2P channels confer potassium selective currents when expressed in Xenopus 
oocytes. The dORK channels encode instantaneous open-pore channel activity. Thus, the 
potassium ions flow either into or out of the cell, depending on the magnitude and 
direction of the electrochemical driving force. In contrast, the human 2P channel 
designated herein as hORKl, is functionally distinguishable from dORK in that the 
hORKl channel permits potassium flow primarily in an outward direction. Even when 
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external potassium concentration is raised to the point where the electrochemical potential 
will drive potassium flux into oocytes containing dORK, little inward potassium current is 
observed in hORKl -containing oocytes. 

It will be understood by those skilled in the art that the invention is not 
limited to the specific nucleotide and amino acid sequences depicted in the Sequence 
Listing, but also includes sequences that hybridize to such depicted sequences. Further, 
the invention also encompasses modifications to the depicted sequences, such as 
deletions, insertions, or substitutions in the sequence which produce changes in the 
resulting protein molecule that are not detrimental to the protein's activity. For example, 
alterations in the gene sequence which reflect the degeneracy of the genetic code, or 
which result in the production of a biologically equivalent amino acid at a given site, are 
contemplated; thus, a codon for the amino acid alanine, a hydrophobic amino acid, may be 
substituted by a codon encoding another less hydrophobic residue such as glycine, or a 
more hydrophobic residue, such as valine, leucine, or isoleucine. Similarly, changes 
which result in substitution of one negatively charged residue for another, such as aspartic 
acid for glutamic acid, or one positively charged residue for another, such as lysine for 
arginine, can also be expected to produce a biologically equivalent product. One skilled 
in the art will understand that assembly of 2P channel into functional dimers may require 
disulfide formation, and should take that into consideration when making modifications as 
taught herein [see e.g., Lesage et al 9 EMBO J. 15, 6400-6407 (1996)]. In some cases, it 
may in fact be desirable to make mutants of the sequence in order to study the effect of 
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alteration on the biological activity of the protein. Each of the proposed modifications is 
well within the routine skill in the art, as is determination of the retention of biological 
activity of the encoded products. 

The present invention further provides functional derivatives of the 
nucleotide sequences encoding the potassium channels of the invention. As used herein, 
the term "functional derivative" is used to define any DNA sequence which is derived 
from the original DNA sequence and which still possesses at least one of the biological 
activities present in the parent molecule. A functional derivative can be an insertion, 
deletion, or a substitution of one or more bases in the original DNA sequence. 

Functional derivatives of the nucleotide sequences as presented herein, 
having an altered nucleic acid sequence can be prepared by mutagenesis of the DNA. 
This can be accomplished using one of the mutagenesis procedures known in the art. For 
example, preparation of functional derivatives may be achieved by site-directed 
mutagenesis. Site-directed mutagenesis allows the production of functional derivatives 
through the use of a specific oligonucleotide which contains the desired mutated DNA 
sequence. Site-directed mutagenesis typically employs a phage vector that exists in both a 
single-stranded and double-stranded form. Typical vectors useful in site-directed 
mutagenesis include vectors such as the M 13 phage, as disclosed by Messing et aL, Third 
Cleveland Symposium on Macromolecules and Recombinant DNA, Editor A. Walton, 
Elsevier, Amsterdam (1981), the disclosure of which is incorporated herein by reference. 
These phage are commercially available and their use is generally well known to those 
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skilled in the art. Alternatively, plasmid vectors containing a single-stranded phage origin 
of replication [Veira et ah, Meth. Enzymol. 153 :3 (1987)] may be employed to obtain 
single-stranded DNA. 

While the site for introducing a sequence variation is predetermined, the 
mutation per se need not be predetermined. For example, to optimize the performance of 
a mutation at a given site, random mutagenesis may be conducted at a target region and 
the newly generated sequences can be screened for the optimal combination of desired 
activity. 

Biologically equivalent refers to those modified nucleic acid and amino 
acid sequences in which the modified sequence at least substantially maintains the 
biological activity of the unmodified sequence; i.e., in the case of a nucleic acid sequence, 
the protein expressed therefrom at least substantially maintains the biological activity. 
Thus, the present invention also relates to the biologically equivalents of the potassium 
channel proteins whether specifically modified as described above or other isolated 
proteins. Biologically equivalent as used herein means protein having some homology 
with the hORK protein, wherein such protein maintains all or substantially all of the 
biological activity of the hORK protein, and contain the pore-forming peptide motif and 
preferably also the Z 4 X 1 X 2 X 3 GX4PX5 motif. The percentage of homology can vary from 
at least about 20% up to about 99.95%. Certainly percentage homologies of at least about 
40%, at least about 70%, at least about 90% or at least about 95% can be employed based 
on the retention of biological activity. One skilled in this art will note that forty percent 
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(40%) homology at amino acid level is usually consistent with retention of comparable 2° 
and 3° structure amongst homologs. 

It is difficult to predict the exact effect of the substitution, deletion, 
insertion, or other modification in advance of making same, or to determine a suspected 
biological equivalent or functional derivative. However, one skilled in the art will 
recognize that the functionality of the modified construct or the suspected biological 
equivalent or functional derivative can be evaluated by routine screening assays. As one 
example, mRNA encoded by a functional derivative made by site-directed mutagenesis 
can be injected into an oocyte as described in the EXAMPLES and the oocyte tested for 
channel activity. Other target constructs may also be tested in this manner. 

Any eukaryotic organism can be used as a source for a protein which is a 
member of the potassium channel family as described herein, or the genes encoding same, 
so long as the source organism naturally expresses such a protein or contains genes 
encoding same. As used herein, "source organism" refers to the original organism from 
which the amino acid or DNA sequence of the protein is derived, regardless of the 
organism the protein is expressed in and ultimately isolated from. For example, a member 
of the hORK family of channel proteins expressed in hamster cells, yeast cells, or the like, 
is of human origin as long as the amino acid sequence is that of a human protein which is 
a member of this family. 
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A variety of methodologies known in the art can be utilized to obtain a 
member of this family of channel proteins. In one method, the protein is purified from 
tissues or cells which naturally produce the protein. One skilled in the art can readily 
follow known methods for isolating proteins in order to obtain a member of the protein 
family, free of natural contaminants. These include, but are not limited to, 
immunochromatography, HPLC, size-exclusion chromatography, ion-exchange 
chromatography, and immunoaffinity chromatography. 

The invention provides further methods of obtaining other members of this 
novel family of potassium channels, i.e., those sharing significant homology to one or 
more regions of the proteins described herein. Specifically, by using the sequences 
disclosed herein as probes or as primers, and techniques such as PCR cloning and 
colony/plaque hybridization, one skilled in the art can obtain other members of the family 
of potassium channel proteins as well as genomic sequences encoding such additional 
family members. 

Region specific primers or probes derived from any of the sequences in the 
Sequence Listing can be used to prime DNA synthesis and PCR amplification, as well as 
to identify colonies containing cloned DNA encoding a member of this family using 
known methods. 

When using primers derived from one of the nucleotide sequences for 
amplification, one skilled in the art will recognize that by employing high stringency 
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conditions, annealing at 50°-60° C, sequences which are greater than 75% homologous to 
the primer will be amplified. By employing lower stringency conditions, annealing at 
35°-37° C, sequences which are greater than 40-50% homologous to the primer will be 
amplified. 

When using DNA probes derived from one of the nucleotide sequences for 
colony/plaque hybridization, one skilled in the art will recognize that by employing high 
stringency condition, hybridization at 50°-65° C, 5X SSPC, 0-50% formamide, wash at 
50°-65° C, 0.5X SSPC, sequences having regions which are greater than 90% 
homologous to the probe can be obtained, and by employing lower stringency conditions, 
hybridization at 35°-37° C, 5X SSPC, 40-45% formamide, wash at 42° C, SSPC, 
sequences having regions which are greater than 35-45% homologous to the probe will be 
obtained. 

Any tissue can be used as the source for the genomic DNA or RNA 
encoding members of the hORK family of potassium channels. However, with respect to 
RNA, the most preferred source is tissues which express elevated levels of the desired 
potassium channel family member. However, using the sequences as taught herein, it is 
now possible to identify such cells using the dORK, cORK or hORK sequence as a probe 
in northern blot or in situ hybridization procedures, thus eliminating the necessity to 
obtain RNA/DNA from a tissue which expresses elevated levels of such protein. 
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Genes encoding the potassium channels of the present invention may be 
expressed in a recombinant host. Heterologous DNA sequences are typically expressed in 
a host by means of an expression vector. An expression vector is a replicable DNA 
construct in which a DNA sequence encoding the heterologous DNA sequence is operably 
linked to suitable control sequences capable of affecting the expression of a protein or 
protein subunit coded for by the heterologous DNA sequence in the intended host. 
Generally, control sequences include a transcriptional promoter, an optional operator 
sequence to control transcription, a sequence encoding suitable mRNA ribosomal binding 
sites, and (optionally) sequences which control the termination of transcription and 
translation. Vectors useful for practicing the present invention include plasmids, viruses 
(including bacteriophage), and integratable DNA fragments (i.e., fragments integratable 
into the host genome by genetic recombination). The vector may replicate and function 
independently of the host genome, as in the case of a plasmid, or may integrate into the 
genome itself, as in the case of an integratable DNA fragment. Suitable vectors will 
contain replicon and control sequences which are derived from species compatible with 
the intended expression host. For example, a promoter operable in a host cell is one 
which binds the RNA polymerase of that cell, and a ribosomal binding site operable in a 
host cell is one which binds the endogenous ribosomes of that cell. 

DNA regions are "operably associated" when they are functionally related 
to each other. For example, a promoter is operably linked to a coding sequence if it 
controls the transcription of the sequence; a ribosome binding site is operably linked to a 
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coding sequence if it is positioned so as to permit translation. Generally, operably linked 
means contiguous and, in the case of leader sequences, contiguous and in reading phase. 

Transformed host cells of the present invention are cells which have been 
transformed or transfected with the vectors constructed using recombinant DNA 
techniques and express the protein or protein subunit coded for by the heterologous DNA 
sequences. The novel nucleic acid sequences of the invention and fragments thereof can 
be used to express protein in a variety of host cells, both prokaryotic and eukaryotic. 
Examples of suitable eukaryotic cells include mammalian cells, plant cells, yeast cells, 
and insect cells. Suitable prokaryotic hosts include Escherichia coli and Bacillus subtilis. 
Illustrative of conventional mammalian host cells are Chinese hamster ovary (CHO) cells, 
COS cells, human embryonic kidney cells, NIH3T3 fibroblasts and mouse Ltk cells. 
Illustrative of insect cells are SP9 cells. 

Suitable expression vectors are selected based upon the choice of host cell. 
Numerous vectors suitable for use in transforming host cells are well known. For 
example, plasmids and bacteriophages, such as A phase, are the most commonly used 
vectors for bacterial hosts, and for E. coli in particular. In both mammalian and insect 
cells, plasmid and virus vectors are frequently used to obtain expression of exogenous 
DNA. In particular, mammalian cells are commonly transformed with conventional viral 
vectors, or transfected with plasmids, such as the pcDNAI vector series from Invitrogen 
Corporation (San Diego, CA) and the pMAM vector series from Clontech, and insect cells 
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in culture may be transformed with baculovirus expression vectors. Yeast vector systems 
include yeast centromere plasmids, yeast episomal plasmids and yeast integrating 
plasmids. The invention encompasses any and all host cells transformed or transfected by 
the claimed nucleic acid sequences or fragments thereof, as well as expression vectors 
used to achieve this. 

In preferred embodiments, the transformed host cells are yeast. A variety 
of yeast cultures, and suitable expression vectors for transforming yeast cells, are known. 
See e.g., U.S. Patent No. 4,745,057; U.S. Patent No. 4,797,359; U.S. Patent No. 
4,615,974; U.S. Patent No. 4,880,734; U.S. Patent No. 4,711,844; and U.S. Patent No. 
4,865,989. Saccharomyces cerevisiae is the most commonly used among the yeasts, 
although a number of other yeast species are commonly available. See, e.g., U.S. Patent 
No. 4,806,472 (Kluveromyces lactis and expression vectors therefore); 4,855,231 (Pichia 
pastoris and expression vectors therefore). A heterologous potassium channel may permit 
a yeast strain unable to grow in medium containing low potassium concentration to 
survive [CY 162, for example, see J. A. Anderson et aL, Proc. Natl. Acad. Sci. USA 89, 
3736-3740 (1992)]. Yeast vectors may contain an origin of replication from the 
endogenous 2 micron (2|i) yeast plasmid or an autonomously replicating sequence (ARS) 
which confer on the plasmid the ability to replicate at high copy number in the yeast cell, 
centromeric (CEN) sequences which limit the ability of the plasmid to replicate at only 
low copy number in the yeast cell, a promoter, DNA encoding the heterologous DNA 
sequences, sequences for polyadenylation and transcription termination, and a selectable 
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marker gene. An exemplary plasmid is Yrp7, [Stinchcomb et aL, Nature 282 , 39 (1979); 
Kingsman et al, Gene 7, 141 (1979); Tschemper et aL, Gene 10, 157 (1980)]. This 
plasmid contains the TRP1 gene, which provides a selectable marker for a mutant strain of 
yeast lacking the ability to grow in the absence tryptophan, for example ATCC No. 
44076. The presence of the trpl lesion in the yeast host cell genome then provides an 
effective environment for detecting transformation by growth in the absence of 
tryptophan. 

Suitable promoting sequences in yeast vectors include the promoters for 
metallothionein (Yep52), 3 -phosphogly cerate kinase [pPGKH, Hitzeman et al., J. Biol. 
Chem. 255, 2073 (1980)] or other glycolytic enzymes [pYSK153, Hess et aL, J. Adv. 
Enzyme Reg. 7, 149 (1968)]; and Holland et aL, Biochemistry 17, 4900 (1978)], such as 
enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosoph-fructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, 
pyruvate kinase, triosephosphate isomerase, phosphoglucost isomerase, and glucokinase. 
Suitable vectors and promoters for use in yeast expression are further described in R. 
Hitzeman et aL , EPO Publn. No. 73,657. Other promoters, which have the additional 
advantage of transcription controlled by growth conditions, are the promoter regions for 
alcohol dehydrogenase 2 (pAD4M), isocytochrome C, acid phosphates, degradative 
enzymes associated with nitrogen metabolism, and the aforementioned metallothionein 
and glyceraldehyde-3-phosphate dehydrogenase, as well as enzymes responsible for 
maltose and galactose (pYES2) utilization. Finally, in constructing suitable expression 
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plasmids, the termination sequences associated with these genes may also be ligated into 
the expression vector 3' of the heterologous coding sequences to provide polyadenylation 
and termination of the mRNA. 

In certain embodiments, the nucleic acid sequences of the invention are 
used to express proteins in a bacterial host. Protein expressed in bacteria can be used in 
raising antisera (both polyclonal and monoclonal) by standard methodology. Such 
antibodies are useful in immunohistochemical studies to determine the level of expression 
of the channel protein in various tissues and cell lines. The channel can be purified from 
bacterial cells if found in inclusion bodies, for example, by isolation of inclusion bodies 
by standard techniques, followed by electrophoresis in SDS-P AGE gels and isolation of 
the protein band from the gel. Alternately, the potassium channel proteins, or portions 
thereof, can be expressed as a fusion protein, e.g., with glutathione-s-transferase, or 
maltose binding protein, and then purified by isolation of the protein to which it is fused. 
In additional embodiments of the invention, the predicted amino acid sequence can be 
used to design synthetic peptides unique to the potassium channels as herein described, 
which peptides can then be used to raise antibodies to the channels. 

The present invention further provides methods of identifying cells or 
tissues which express a member of the family of channel proteins presented herein. For 
example, a probe comprising a DNA sequence of hORKl, a fragment thereof, or a DNA 
sequence encoding another member of the hORKl family of channel proteins can be used 
as a probe or amplification primer to detect cells which express a message homologous to 
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the probe or primer. One skilled in the art can readily adapt currently available nucleic 
acid amplification or detection techniques so that it employs probes or primers based on 
the sequences encoding a member of this family. 

The materials for use in these embodiments are ideally suited for the 
preparation of a kit. Specifically, a kit is provided, which is compartmentalized to receive 
in close confinement, one or more containers which comprises: (a) a first container 
comprising one or more probes or amplification primers based on the hORK sequence or 
any of the other sequences, or simply a fragment containing nucleic acids that encode 
ZXXZ 1 Z 2 Z 3 GXG and Z 4 X 1 X 2 X3GX 4 PX 5 ; and (b) one or more other containers 
comprising one or more of the following: a sample reservoir, wash reagents, reagents 
capable of detecting presence of bound probe from the first container, or reagents capable 
of amplifying sequences hybridizing to the amplification primers. 

A compartmentalized kit includes any kit in which reagents are contained 
in separate containers. Such containers include small glass containers, plastic containers 
or strips of plastic or paper. Such containers allow one to efficiently transfer reagents 
from one compartment to another compartment such that the samples and reagents are not 
cross-contaminated and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the probe or 
primers used in the assay, containers which contain wash reagents (such as phosphate 
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buffered saline, Tris buffers, etc.), and containers which contain the reagents used to 
detect the bound probe or amplified product. 

Types of detection reagents include labeled secondary probes, or in the 
alternative, if the primary probe is labeled, the enzymatic, or antibody binding reagents 
which are capable of reacting with the labeled probe. One skilled in the art will readily 
recognize that probes and amplification primers based on the sequence disclosed in the 
present invention can be readily incorporated into one of the established kit formats which 
are well known in the art. 

The sequences of the present invention are also valuable for chromosome 
identification. The sequence may be specifically targeted to and hybridize with a 
particular location on an individual chromosome, for example, the human chromosome. 
Moreover, there is a current need for identifying particular sites on the chromosome. Few 
chromosome marking reagents based on actual sequence data (repeat polymorphisms) are 
presently available for marking chromosomal location. The mapping of DNA to 
chromosomes according to the present invention is an important first step in correlating 
those sequences with genes associated with disease, or tracking other possible disease 
pathways. 

Briefly, sequences can be mapped to chromosomes by preparing PCR 
primers (preferably 1 5-25 bp) from the cDNA. Computer analysis of the cDNA is used to 
rapidly select primers that do not span more than one exon in the genomic DNA, thus 
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complicating the amplification process. These primers are then used for PCR screening of 
somatic cell hybrids containing individual chromosomes. Only those hybrids containing 
the gene corresponding to the primer will yield an amplified fragment. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a 
particular DNA to a particular chromosome. Using the present invention with the same 
oligonucleotide primers, sublocalization can be achieved with panels of fragments from 
specific chromosomes or pools of large genomic clones in an analogous manner. Other 
mapping strategies that can similarly be used to map to its chromosome include in situ 
hybridization, prescreening with labeled flow-sorted chromosomes and preselection by 
hybridization to construct chromosome specific-cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a cDNA clones to a 
metaphase chromosomal spread can be used to provide a precise chromosomal location in 
one step. This technique can be used with cDNA as short as 500 or 600 bases; however, 
clones larger than 2,000 bp have a higher likelihood of binding to a unique chromosomal 
location with sufficient signal intensity for simple detection. FISH requires use of the 
large clones from which the cDNA was derived, and the longer the better. For example, 
2,000 bp is good, 4,000 is better, and more than 4,000 is probably not necessary to get 
good results a reasonable percentage of the time. 

Once a sequence has been mapped to a precise chromosomal location, the 
physical position of the sequence on the chromosome can be correlated with genetic map 
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data. Such data are found, for example, in V. McKusick, Mendelian Inheritance in Man 
(available on line through Johns Hopkins University Welch Medical Library). The 
relationship between genes and diseases that have been mapped to the same chromosomal 
region are then identified through linkage analysis (coinheritance of physically adjacent 
genes). 

Next, it is necessary to determine the differences in the cDNA or genomic 
sequence between affected and unaffected individuals. If a mutation is observed in some 
or all of the affected individuals but not in any normal individuals, then the mutation is 
likely to be the causative agent of the disease. 

With current resolution of physical mapping and genetic mapping 
techniques, a cDNA precisely localized to a chromosomal region associated with the 
disease could be one of between 50 and 500 potential causative genes. (This assumes 1 
megabase mapping resolution and one gene per 20 kb). 

Comparison of affected and unaffected individuals generally involves first 
looking for structural alterations in the chromosomes, such as deletions or translocations 
that are visible from chromosome spreads or detectable using PCR based on that cDNA 
sequence. Ultimately, complete sequencing of genes from several individuals is required 
to confirm the presence of a mutation and to distinguish mutations from polymorphisms. 

In yet another embodiment of the present invention, a yeast expression 
system is described, wherein yeast cells bear heterologous potassium channels. Cloning 
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and expression of potassium channels from heterologous species such as those described 
herein are useful in the discovery of new pesticides, and animal and human therapeutics. 
Discovery of such compounds will necessarily require screening assays of high specificity 
and throughput. For example, new pesticides directed at potassium channels require high 
selectivity for insect channels and low activity against non-insect species. Screening 
assays utilizing yeast strains genetically modified to accommodate functional expression 
of heterologous potassium channels offer significant advantages in this area. In preferred 
embodiments, these channels expressed in heterologous yeast cells are dORK, RAK (as 
described below), Shal, Shaw, Eag, cORK, or hORKl. As noted above, transformed host 
cells of the present invention express the proteins or protein subunits coded for by the het- 
erologous DNA sequences. When expressed, the potassium channel is located in the host 
cell membrane (i.e., physically positioned therein in proper orientation for both the 
stereoselective binding of ligands and passage of potassium ions). In other preferred 
screening embodiments of the present invention, the potassium channel is positioned 
within a cell membrane in such a manner as to allow it to function as a modulator of the 
flow of potassium ions into and out of the cell. To best regulate this activity, at least one 
pore-forming domain may be positioned proximal to a exterior portion of the cell 
membrane. Thus, in certain preferred screening embodiments of the present invention, a 
transformed yeast cell is presented, containing a heterologous DNA sequence which codes 
for a potassium channel, as herein presented, cloned into a suitable expression vector. 
Various other useful potassium channels may be utilized in the screening assay 
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embodiments of the present invention, such as a delayed rectifier potassium channel 
referred to as "RAK or RATRAK" [Paulmichl et al , Proc. Natl. Acad. Sci, USA 88, 
7892-7895 (1991), reporting the cloning of this potassium channel from rat cardiac 
tissue.] RAK is capable of complementing the potassium-dependent phenotype of 
Saccharomyces cerevisiae strain CY162 on medium containing low potassium 
concentration. 

Using the purified proteins, or polypeptide sequences of the invention, the 
present invention provides methods of obtaining and identifying agents capable of binding 
to or otherwise interacting with the potassium channels of the invention. 

In detail, said method comprises: 

(a) contacting a substance with a select member of the family of 
potassium channels or select channel peptides or proteins; and 

(b) determining whether the substance interacts with said channel, 
peptide, or protein. 

The screened substances in the above assay can be, but are not limited to, 
proteins, peptides, peptidomimetics, carbohydrates, vitamin derivatives, compounds, or 
other pharmaceutical agents or any mixtures thereof. The substances can be selected and 
screened at random or rationally selected or designed using protein modeling techniques. 
As used herein, a substance is said to be "rationally selected or designed" when the 
substance is chosen based on the configuration of the particular member of the claimed 
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family of channel proteins. For example, one skilled in the art can readily adapt currently 
available procedures to generate peptides, pharmaceutical agents and the like capable of 
binding to a specific peptide sequence in order to generate rationally designed antipeptide 
peptides, for example see Hurby et aL, "Application of Synthetic Peptides: Antisense 
Peptides/' In Synthetic Peptides, A User's Guide, W. H. Freeman, N.Y., 289-307 (1992), 
and Kaspczak et a/., Biochemistry 28, 9230-8 (1989). Pharmaceutical agents and the like 
may be similarly generated using techniques known to the art. 

The present invention further provides methods for modulating the 
expression of hORK, or a member of the hORK family of channel proteins. Specifically, 
anti-sense RNA expression is used to disrupt the translation of the mRNA encoding the 
hORK protein. 

In detail, a cell is modified using routine procedures such that if expresses 
an antisense mRNA, an mRNA which is complementary to mRNA encoding the hORK 
family member. By constitutively or inducibly expressing the antisense RNA, the 
translation of the hORK family member mRNA can be regulated. 

In certain preferred embodiments, the cloning of the members 
disclosed herein now makes possible the screening capability which enables the 
identification of agonists (potassium channel openers) and antagonists (potassium 
channel closers) of this family of channel proteins. The two-pore K + channels 
described herein in humans can be used as targets for novel human therapeutics. The 
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primary target for such therapeutic agents will be conditions related to alterations in 
the plasma membrane resting potential and/or the duration of the action potential in 
excitable cells. Potassium channels influence action waveforms and firing frequency 
of cells and therefore play a role in neuronal integration muscle contraction, and 
hormone secretion in excitable cells. Potassium channels play the vital role of 
determining resting electrical membrane potential by setting membrane permeability 
to potassium ions in the cell. Inward conductance at membrane potentials below K + 
equilibrium potential (E^) prevents excessive hyperpolarization which may be caused 
by the electrogenic Na + pump; the slight outward conductance of inward rectifier K + 
channels at membrane potentials just above K + equilibrium helps to keep the resting 
membrane potential close to E k . Modulation of the conductance level of potassium 
channels changes the resting potential and alters the excitability of a cell; i.e. the 
activation of a particular type of inward rectifier K + channel has been shown to cause 
hyperpolarization of the cardiac pacemaker cells and slows the heartbeat. Thus, 
modulation of potassium channels can occur when one provides to cells, agents 
capable of binding to the potassium channel proteins. 

In the cardiovascular area, this class of potassium channels may be of 
use in the discovery of new agents for the treatment of atrial and ventricular 
arrhythmias, heart failure including associated arrhythmias and cardiac ischemia. The 
action of such agents would be effected through the modulation of the kinetics 
duration of the cardiac action potential. 
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Modulation of cardiac action potential by compounds that effect the 
behavior of potassium channels may be a useful treatment for serious heart conditions. 
The delayed rectifier potassium current in heart cells regulates the duration of the 
plateau of the cardiac action potential by countering the depolarizing, inward calcium 
current. Delayed rectifier potassium currents characteristically are activated upon 
depolarization from rest, display a sigmoidal or delayed onset, and have a nonlinear, 
or rectifying, current-voltage relationship. Several types of delayed potassium 
conductances have been identified in cardiac cells based on measured single-channel 
conductances. Heart-rate and contractility are regulated by second messenger modi- 
fication of delayed rectifier potassium conductances, and species differences in the 
shape of the plateau may be influenced by the type and level of channel expression. 

Potassium channel openers may also function as smooth muscle relaxants, functioning as 
vasodilators, vasospasmolytics, and other smooth muscle spasmolytic. As vasodilators, 
these compounds have use as dilators of peripheral vasculature, coronary arteries, renal 
vasculature, cerebral vasculature, and mesenteric vasculature. As vasospasmolytics, these 
compounds have use in the treatment of coronary artery spasm, peripheral vascular spasm, 
cerebral vascular spasm and impotence. Other smooth muscle spasmolytics have use as 
bronchodilators, in the control of urinary bladder and gall bladder spasm, and in the 
control of esophageal, gastric, and intestinal smooth muscle spasm. 

Potassium channel closers may function in the pancreas to enhance release 
of insulin, in the kidney as diuretics and renal epithelial anti-ischemic agents, as 
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hypertensive agents for promoting vasoconstriction for use in hypotensive states as 
antiarrhythmic agents, and as agents for modifying cardiac muscle contractility. 

Other uses for potassium channel agonists or antagonists include 
anticonfulsants, hair growth promoting agents, and agents effective in preventing or 
reducing skeletal muscle damage or fatigue. 

Thus, in yet further preferred embodiments, methods of modulating 
cellular activity to provide theraeutic value are provided, by applying to a patient in need 
of such modulation, a substance capable of interacting with a potassium channel contained 
in the relevant cells of such patient and modulating the activity of same (a good example 
of which are cardiac cells, useful for cardiac modulation purposes). These aspects of the 
present invention relate to methods of modulating potassium channel activity, by affecting 
the ability of such channel to allow the flow of ions into, through, or out of a cellular 
membrane, and particularly when these ions are potassium ions. Certain substances 
whether biological or chemical in nature, may be applied to cell membranes having as an 
integral part of their structure, one or more potassium channels as presented herein, and 
particularly those comprising the amino acid sequences of SEQ ID NO: 2, SEQ ID NO: 4, 
SEQ ID NO: 36, SEQ ID NO: 46, or RAK, in an amount and for a time sufficient to affect 
the ability of the potassium channel to so regulate the flow of ions. Substances that are 
potassium channel blockers will inhibit the ability of the channel to regulate the flow of 
such ions. Substances that enhance such ability may be considered potassium channel 
"activators." 
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Application of such substances may take the form of in vitro, ex vivo, or in 
vivo application, each in a formulation suitable to deliver the substance to the cell 
membrane and to sustain such delivery for a time sufficient to allow the substance to 
interact with the membrane. Appropriate formulations, concentrations of substances, 
application time, and other relevant parameters may be established by utilizing, inter alia, 
known assays for measuring ion channel current flow. Such compositions may comprise 
conventional delivery/carrier systems, e.g., liposome or phospholipid encapsulation, water 
or saline solutions, polymeric compositions, and the like. Another suitable endpoint one 
skilled in the art may utilize in optimizing these parameters, especially in the case of 
potassium channel blockers, is "cell death". Such assays may be performed in vitro and 
extrapolated to in vivo conditions, or in some cases may be easily established directly in 
vivo the field of insecticides is instructive for this purpose. For example, by applying the 
substance directly to a test sample comprising the target insect pest (whole organism) and 
noting the appropriate parameters at which an acceptable per cent of insect death is 
attained. 

In certain other preferred embodiments, methods of selectively inhibiting 
insect pests are presented by applying to such insect pests a substance capable of 
selectively inhibiting the activity of a potassium channel contained in the cells of such 
insect, and comprising the amino acid sequence of SEQ ID NO:2, or a potassium channel 
biologically equivalent thereto. In the most preferred embodiments, the inhibitor will 
inhibit the activity of the aforementioned potassium channel without inhibition of other, 
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non-homologous or otherwise non-equivalent potassium channels that may be present in 
species other than the targeted insect pest. It is envisioned that such other species may 
also be present at the site of application of the inhibitor, such as in a garden, crop, or other 
site wherein it is desired to control insect pests. In other preferred embodiments, methods 
of selectively inhibiting nematode pests are presented much in the same manner as 
discussed for control of insect pests, by applying to such pests a substance capable of 
selectively inhibiting the activity of a potassium channel contained in the cells of such 
pest, said potassium channel comprising the the amino acid sequence of SEQ ID NO:4, 
SEQ ID NO: 36, or potassium channels biologically equivalent thereto. 

The present invention further provides methods for generating chimeric or 
transgenic animals 1) in which the animal contains one or more exogenously supplied 
genes which are expressed in the same temporal and spatial manner as a member of the 
family of channel proteins as presented herein, or 2) in which such member of this family 
of channel proteins has been deleted or overexpressed. Such chimeric and transgenic 
animals are useful in the further elucidation of the mechanisms of potassium channel 
function as well as their effect an animal physiology. These transgenic and chimeric 
animals are produced by utilization of techniques which are well known and well 
described in the technical literature, e.g., see U.S. Patent No. 5,434,340 and scientific 
references cited therein discussing inter alia, the introduction of transgenes into the 
gumline of a non-human animal, herein incorporated by reference. 
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The following Examples are provided to further illustrate various aspects 
of the present invention. They are not to be construed as limiting the invention. 



EXAMPLE 1 

Using the yeast expression technology and other teachings as set forth 
herein, the present inventors have isolated a single 2463 base pair cDNA fragment from 
an invertebrate source, designated Dm ORF1 [SEQ ID NO: 1], by complementation of the 
potassium-dependent phenotype of Saccharomyces cerevisiae strain CY162 (trklA) on 
medium containing low potassium concentration [J. A Anderson et al y Proc. Natl. Acad. 
Sci USA 89, 3736-3740 (1992)]. Dm ORF1 contains a single long open reading frame 
encoding a protein of 618 amino acids [SEQ ID NO:2] that exhibits substantial amino acid 
identity to the pore-forming regions of other potassium channels. The DmORFl contains 
structural features that distinguish it from other classes of potassium channels, including 
four hydrophobic domains capable of forming transmembrane helices (M1-M4) and two 
putative pore forming H5 domains found between transmembrane helices Ml and M2, 
and M3 and M4. Each pore forming H5 domain contains the Y/F-G dipeptide motif 
required for potassium selectivity [Heginbotham et ah, Science 258 , 1 152-1 155, (1992)]. 
This work was expanded to clone a construct derived from C. elegans having a single 
open reading frame sufficient to encode a protein of 434 amino acids, designated pCORK. 
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A search of the GENBANK database for DNA and protein sequences 
similar to DmORFl revealed several cloned potassium channel sequences including a 
putative protein coding DNA sequence, F22b7.7, reported in the Caenorhabditis elegans 
genome sequencing project [Wilson et al 9 Nature 368, 32-38 (1994)]. The DNA 
sequence contained a single long open reading frame sufficient to encode a protein of 336 
amino acids (predicted MW 38.5 kDa) with substantial homology to known potassium 
channel sequences. 

Using the hybridization approach, a cDNA sequence designated CeORFl 
[SEQ ID NO: 38] was isolated by probing a Caenorhabditis elegans cDNA library with 
oligonucleotides designed using F22b7.7 DNA sequences [T.N. Davis and J. Thorner 
Meth. Enzymol. 139, 246-262 (1987)]. CeORFl contains a single long open reading 
frame encoding a protein that exhibits substantial amino acid identity to pore-forming 
regions of other potassium channels. DNA sequences encoding a human putative two- 
pore potassium channel were cloned by polymerase chain reaction (PCR) from human 
brain cDNA. Degenerate oligonucleotides (5' and 3* oligo) used in the analysis were 
designed from a compilation of nucleotide sequences encoding the pore-forming domains 
of putative two pore potassium channels identified in a search of the GENBANK DNA 
sequence database. 

CeORFl and pCORK each contain structural features similar to DmORFl, 
including two putative pore forming H5 domains. Each pore forming H5 domain contains 
the Y/F-G dipeptide motif required for potassium selectivity [Heginbotham et aL, Science 
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258 . 1 152-1 155, (1992)]. These features form the basis of the designation of a new sub- 
family of potassium channels comprising DmORFl, CORK, CeORFl, hORK, and 
various other homologs. The particulars of this discovery is set forth in more detail 
below: 



Recombinant expression library screening, 

Saccharomyces cerevisiae strain CY162 is described in Anderson, J.A. et al, Proc. Natl. 
Acad. Sci. USA 89, 3736-3740 (1992)]. Growth of bacterial strains and plasmid manipu- 
lations are performed by standard methods (Maniatis T., Molecular Cloning. Cold Spring 
Harbor Laboratory Press, 1982). Media conditions for growth of yeast, isolation of 
plasmid DNA from yeast, and DNA-mediated transformation of yeast strains are as 
described (Rose M. D., Methods in yeast genetics, Cold Spring Harbor Laboratory Press, 
1990). A multifunctional expression library constructed in pYES2 and containing cDNA 
made from 3rd instar male Drosophila melanogaster mRNA is used as described [SJ. 
Elledge etal, Proc. Natl. Acad. Sci USA 88, 1731-1735 (1991)]. A multifunctional ex- 
pression library constructed in pYES2 and containing cDNA made from mRNA obtained 
from all life stages of Caenorhabditis elegans is custom-made by Invitrogen Corporation. 
Isolation of expression plasmids encoding heterologous pota ssium channels. CY162 
cells are transformed with plasmid DNA from each library to give 3 x 10 6 transformants 
from each library on SCD-ura (synthetic complete dextrose (2 %) medium containing all 
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necessary nutritional supplements except uracil) containing 0.1 M KC1 agar medium. 
Transformants are replica-plated to SCG-ura (synthetic complete galactose (2 %) medium 
containing all necessary nutritional supplements except uracil) agar medium. Colonies 
that grow on this selective agar medium are transferred to SCG-ura agar medium to obtain 
single colonies clones and while reassaying suppression of the potassium-dependent phe- 
notype. Plasmid DNA is isolated from surviving colonies and used to transform CY162. 
Six individual transformant strains containing one plasmid, pDmORFl,that confers the 
potassium independent phenotype is cultured on SCD-ura and SCG-ura medium along 
with CY162 strains bearing pKATl, which encodes a plant inward rectifier potassium 
channel that supports the growth of CY162 on selective medium (FIGURE 1). The 
plasmid bearing strains exhibit potassium-independent growth on both dextrose and 
galactose containing medium. Growth on dextrose is likely due to basal level of tran- 
scription leading to sufficient potassium channel expression to support growth. 



EXAMPLE 2 



DNA sequence analysis of DmORFl. Plasmids that confer suppression of the 
potassium-dependent phenotype are subjected to automated DNA sequence analysis 
performed by high temperature cycle sequencing (Applied Biosystems). Geneworks DNA 
sequence analysis software (Intelligenetics) is used to align raw DNA sequence 
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information and to identify open reading frames. The DNA sequence of the 2.4 kb insert 
in pDmORFl is displayed in FIGURE 2A and 2B [SEQ ID NO:l]. The 5 f untranslated 
sequences of the cDNA contain long poly A and poly T tracts not likely to be found in 
protein coding regions. The first ATG proximal to the 5' end is present in a consensus 
Drosophila melanogaster translational initiation site [D.R. Cavener, Nucleic Acids Res., 
15, 1353-1361 (1987)], consistent with the designation of this site as the translational start 
site. A single long open reading frame sufficient to encode a protein of 618 amino acids 
(predicted MW 68 kDa) is encoded in pDmORFl. A consensus polyadenylation site, 
AATCAA, occurs at position 2093-2098 in 3' untranslated sequences. The DmORFl 
contains structural features that distinguish it from other classes of potassium channels, 
including four hydrophobic domains capable of forming transmembrane helices (M1-M4) 
and two pore forming H5 domains found between transmembrane helices Ml and M2 ? 
and M3 and M4. Each pore forming H5 domain contains the Y/F-G dipeptide motif 
required for potassium selectivity [Heginbotham et al, Science 258 , 1 152-1 155, (1992)]. 

EXAMPLE 3 

Identification of Caenorhabditis elegans sequences homologous to DmORFl. A 

search of the GENBANK database protein sequences similar to DmORFl reveals 
significant matches with several known potassium channel sequences. The closest match 
is to a putative protein coding DNA sequence, F22b7.7, reported in the Caenorhabditis 
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elegans genome sequencing project [Wilson et aL, Nature 368, 32-38 (1994)]. The DNA 
sequence and predicted amino acid sequence assembled from putative exons recognized 
by a GENBANK exon identification algorithm is displayed in FIGURE 3A and 3B [SEQ 
ID NOS:3 and 4]. The DNA sequence contains a single long open reading frame 
sufficient to encode a protein of 336 amino acids (predicted MW 38.5 kDa) with 
substantial homology to known potassium channel sequences. The F22b7.7 sequence 
contains structural features that distinguish it from other classes of potassium channels, 
including three of four hydrophobic domains capable of forming transmembrane helices 
(M1-M4) identified in DmORFl and two pore forming H5 domains found between 
transmembrane helices a predicted Ml and M2 3 and M3 and M4. Each pore forming H5 
domain contains the Y/F-G dipeptide motif required for potassium selectivity 
[Heginbotham et al 9 Science 258 . 1 152-1 155, (1992)]. The lack of an amino terminal 
transmembrane domain homologous to DmORFl Ml in the F22b7.7 sequence may be 
due to failure of the search algorithm to identify exon(s) encoding the amino terminus. 
Alternatively, an amino terminal coding sequence may be added by trans-splicing, which 
occurs frequently in Caenorhabditis elegans. 



EXAMPLE 4 
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Clonin g and DNA sequence analysis of CeORFl. Oligonucleotides corresponding to 
DNA sequences encoding the two pore forming domains of F22b7.7 are synthesized using 
an Applied Biosystems DNA synthesizer. 
F22b7.7-H2-1: 

5TCCATTTTCTTTGCCGTAACCGTCGTCACTACCATCGGATACGGTAATCCA 
[SEQ ID NO:5]. F22b7.7-H2-2: 

5TCATTCTACTGGTCCTTCATTACAATGACTACTGTCGGGTTTGGCGACTTG 
[SEQ ID NO:6]. The oligos were labelled at their 5' ends with 32 P using a 5 ? -end labelling 
kit according to manufacturers instructions (New England Nuclear). The labelled oligos 
are pooled and used to screen 6 x 10 5 plaques from a XZAP-Caenorhabditis elegans 
cDNA library (obtained from Clontech) by published methods [T.N. Davis and J. Thorner 
Meth. Enzymol. 139, 246-262 (1987)]. Hybridization is at 42°C for 16 hours. Positive 
clones are plaque-purified by twice repeating the hybridization screening process. Plas- 
mid DNAs, excised from phage DNA according to manufacturers instructions, are 
subjected to automated DNA sequence analysis performed by high temperature cycle 
sequencing (Applied Biosystems). Geneworks DNA sequence analysis software 
(Intelligenetics) is used to align raw DNA sequence data and to identify open reading 
frames. 



EXAMPLE 5 
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Comparison of the putative proteins encoded by DmOR Fl and F22h7.7. Predicted 
amino acid sequences of DmORFl and F22b7.7 are aligned and displayed in FIGURE 4 
[SEQ ID NOS:37 and 38]. Only limited overall amino acid homology is exhibited by 
these two proteins with regions of greatest homology existing in the pore forming H2-1 
and H2-2 domains. FIGURE 5A shows a comparison of the pore forming domains of 
DmORFl and F22b7.7 with those of the known Drosophila melanogaster potassium 
channel and inward rectifier sequences [SEQ ID NOS:7 through 21]. Amino acid 
identities greater than 50 % are observed with all potassium channel sequences. FIGURE 
5B shows hydropathy plot analysis of DmORFl and F22b7.7. The two proteins, which 
show remarkable topological similiarity through their length, are predicted to be com- 
posed of four membrane-spanning hydrophobic domains (M1-M4), and two pore forming 
H2 domains. These data suggest the predicted topology shown in FIGURE 6. Both 
proteins are predicted to span the membrane four times with amino and carboxyl termini 
residing within the cell. This topology places the single amino-terminal asparagine-linked 
glycosylation site and H2 domains on the cell exterior permitting permeation of the 
membrane by the pore forming domains from the outside, an absolute requirement for the 
formation of a functional potassium channel. 



EXAMPLE 6 
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Functional expression of a rat atrial delayed rectifier potassium channel in yeast 

CY162 transformants containing plasmids pKATl , which encodes a plant inward rectifier 
potassium channel, pRATRAK, which encodes a rat atrial delayed rectifier potassium 
channel, pDmORFl, and control plasmid pYES are cultured on arginine-phosphate- 
dextrose agar medium lacking ura medium [A. Rodriguez-Navarro and J. Ramos, J. 
Bacteriol. 159, 940-945, (1984)] containing various KC1 concentrations (FIGURE 7). 
Strains containing pKATl, pRATRAK, and pDmORFl all support the growth of CY162 
on medium containing a low concentration of potassium, while pYES2 containing CY162 
cells only grow on medium containing a high potassium concentration, indicating that 
heterologous potassium channels of several different types function to provide high 
affinity potassium uptake. 

pRATRAK is constructed by modifying the protein-coding sequences of 
RATRAK to add 5' Hindlll and 3' Xbal sites using PCR. In addition, four A residues are 
added to the sequences immediately 5 1 proximal to the initiator ATG to provide a good 
yeast translational initiation site. The modified fragment is cloned into the Hindlll and 
Xbal sites in the yeast expression vector pYES2 (Invitrogen), forming pRATRAK. 



EXAMPLE 7 
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Bioassay of functional expression of heterologous potassium channels. 

Yeast strains dependent on heterologous potassium channels for growth 
should be sensitive to non-specific potassium channel blocking compounds. To test the 
potassium channel blocking properties of several compounds, a convenient agar plate 
bioassay is employed. Strains containing pKATl, pRATRAK, pDmORFl, and pYES2 
are plated in arginine-phosphate-dextrose agar medium lacking ura and containing various 
amounts of potassium chloride. Arginine-phosphate-dextrose medium is used to avoid 
interference from potassium and ammonium ions present in standard synthetic yeast cul- 
ture medium. Sterile filter disks were placed on the surface of the agar and saturated with 
potassium channel blocking ions CsCl, BaCl 2 , and TEA. The growth of heterologous 
potassium channel containing strains is inhibited by potassium channel blocking ions, in a 
channel dependent manner. DmORFl -dependent growth is blocked by BaCl 2 but not by 
CsCl or TEA. KAT-dependent growth is blocked by BaCl 2? CsCl and TEA. RATRAK- 
dependent growth is blocked by BaCl 2 , CsCl and TEA to a much greater extent than 
pKATl , reflecting in part a slower growth rate of pRATRAK-containing cells. These 
observations confirm that these channels support the growth of the mutant yeast cells and 
demonstrate the efficacy of the yeast bioassay for screening for compounds that block 
potassium channel function. The control pYES-containing strain grows only around 
applied KC1 and RbCl, a congener of KCL 
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EXAMPLE 8 



Identification of compounds that alter potassium channel activity. 

Yeast strains made capable of growing on medium containing low 
potassium concentration by expression of heterologous potassium channels are used to 
screen libraries of chemical compounds of diverse structure for those that interfere with 
channel function. CY162 cells containing pKATl, pRATRAK, pDmORFl, pCeORFl, 
and pYES2-TRKl (10 4 /ml) are plated in 200 ml of arginine-phosphate-dextrose agar 
medium lacking ura and containing 0.2 mM potassium chloride in 500 cm 2 plates. The 
CY162 cells bearing pYES2-TRKl are included in the assay as a control to identify 
compounds that have non-specific effects on the yeast strain and are therefore not 
specifically active against the heterologous potassium channels. Samples of chemical 
compounds of diverse structure (2 pi of 10 mg/ml solution in DMSO) are applied to the 
surface of the hardened agar medium in a 24 x 24 array. The plates are incubated for 2 
days at 30°C during which time the applied compounds radially diffuse into the agar 
medium. The effects of applied compounds on strains bearing heterologous potassium 
channel genes are compared to the pYES2-TRKl bearing strain. Compounds that cause a 
zone of growth inhibition around the point of application that is larger on plates 
containing cells bearing the heterologous potassium channels than that observed around 
the pYES2-TRKl bearing strains are considered selective potassium channel blockers. 
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growth around the point of application that is 
heterologous potassium channels than that 
strains are considered selective potassium 



EXAMPLE 9 



DmORFl-induced currents inX laevis oocytes assaye d by two-electrode voltage 
clamp. 

DNA sequence analysis of the pDmORFl insert strongly suggest that the 
protein encoded by the single long ORF possesses properties in common with known 
potassium channels. To test this hypothesis, the electrophysiological properties of the 
putative potassium channel encoded by DmORFl was examined by expression mX. 
laevis oocytes. Currents were measured by two-electrode whole-cell voltage clamp. 
DNA sequences encoding the open reading frame of DmORFl were amplified by 
polymerase chain reaction (PCR) using the following oligonucleotides: 
MP023: ATAAAGCTTAAAAATGTCGCCGAATCGATGGAT [SEQ ID NO:22] 
MP024: AGCTCTAGACCTCCATCTGGAAGCCCATGT [SEQ ID NO:23] 
The full length PCR product was cloned into corresponding sites in pSP64 poly A 
(Promega), forming pMP147. Template DNA was linearized with EcoRI and RNA 
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transcribed using the Message Machine (Ambion) in vitro transcription kit according to 
manufacturers instructions. A sample of the RNA was resolved in a MOPS-acetate- 
formaldehyde agarose gel and RNA content was estimated by ethidium bromide staining. 
The remainder was stored on dry ice. X laevis oocytes were isolated and injected with 50 
nl of sterile TE containing 5-20 ng transcript according to published procedures. After 
three days, whole oocyte currents were recorded using a two-electrode voltage clamp. 
Electrodes contained 3M KC1 and had resistances of 0.3-1 .0 MW. Recordings were 
performed with constant perfusion at room temperature in the presence of either low (10 
mM) or high (90 mM) potassium chloride. Two electrode voltage clamp analysis of the 
DmORFl gene product expressed in X laevis oocytes demonstrates properties of a 
voltage- and potassium-dependent potassium channel. At low potassium concentrations, 
DmORFl exhibited outward current at depolarizing potentials. At high potassium 
concentration, DmORFl exhibits both inward and outward currents. The DmORFl 
channel displays a high preference for potassium and shows cation selectivity in the rank 
order K>Rb>NH 4 >Cs>Na>Li. Potassium currents were greatly attenuated by BaCl 2 . 

EXAMPLE 10 

Developmental regulation of DmORFl expression in ZX melan ogaster determined by 
northern blotting analysis. 
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Isolation of pDmORFl from a D. melanogaster expression library strongly 
suggests that the insert contained within originated in mRNA from that species. Detailed 
understanding of the developmental regulation of DmORFl expression should aid in 
determining strategies for use of DmORFl as a target for novel insecticides. To 
characterize DmORFl expression, northern blotting analysis of poly A RNA from various 
stages of the D. melanogaster life cycle was carried out. 

D. melanogaster poly A+ RNA from embryo, larvae and adult forms 
(Invitrogen, 5 mg) was resolved in a MOPS -acetate-formaldehyde agarose gel according 
to standard procedures. The gel was stained with ethidium bromide and photographed to 
mark the positions of 18 S and 28 S ribosomal RNAs used as molecular weight markers. 
RNA was transferred by capillary action to nitrocellulose with 10 x SSPE. The blot was 
air-dried, baked for one hour at 80°C, and prehybridized in 4x SSPE, 1% SDS, 2x 
Denhardt's, 0.1 % single stranded DNA at 68°C for 2 hours. 

A 2.4 kb Xhol fragment of DmORFl was isolated from pDmORFl and 
labeled with a- 32 P dCTP using the Ready-to-Go kit (Pharmacia) according to 
manufacturers instructions. The probe was denatured by heating to 100°C for 5 minutes 
followed by quenching in an ice water bath. The probe was added to the prehybridization 
solution and hybridization continued for 24 hours at 68°C. 

The blot was washed briefly with 2x SSPE, 0.1% SDS at room temperature 
followed by 0.5 x SSPE, 0.1 % SDS at 65°C for 2 hours. The blot was air-dried and 
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exposed to Reflection X-ray film (NEN) using an intensifying screen at -70°C for 48 
hours. 

Northern blotting analysis indicates that the DmORFl probe hybridizes to 
an mRNA species of approximately 2.8 kb isolated from D, melanogaster embryo, larvae, 
and adult forms. The length of the DmORFl mRNA corresponds well with the length of 
the predicted ORF. Thus, the DmORF is expressed at all developmental stages in the life 
cycle of D. melanogaster. 



ff EXAMPLE 11 

H Expression of the DmORFl gene product in vitro. 

Lk DNA sequence analysis of the pDmORFl insert reveals a single long ORF 

. ^ 

Si with conserved amino acid sequence domains in common with known potassium 

channels. The DNA sequence predicts an ORF sufficient to encode a protein of 618 amino 
acid in length. The DmORFl polypeptide contains four segments of at least 20 
hydrophobic amino acids in length suggesting that the segments span the plasma 
membrane. In addition, the DmORFl protein sequence contains a putative N-linked 
glycosylation site (Asn-Thr-Thr) at amino acids 58-60. To confirm that a protein of the 
predicted size of DmORF is expressed from the insert in pDmORFl and to test the 
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proposition that DmORFl is glycosylated, pDmORFl was used as template to drive 
coupled in vitro transcription/translation. 

Plasmid pMP147 was used as template to produce 35 S-labeled DmORFl 
gene product in vitro using a TnT coupled transcription-translation kit (Promega) 
according to manufacturers instructions. Glycosylation of the nascent DmORFl poly- 
peptide was accomplished by addition of canine pancreatic microsomes (Promega) to the 
transcription-translation reaction. Samples of glycosylated DmORF protein were treated 
with endoglycosidase H to remove added carbohydrate moieties. Aliquots were 
precipitated with TCA and collected on GF/C filters, washed with ethanol, dried and 
counted. Equivalent cpm's were resolved by SDS-PAGE. The gel was impregnated with 
soluble fluor Amplify (Amersham) and dried onto Whatman 3 MM paper. The dried gel 
was exposed to Reflection X-ray film at room temperature. 

Translation of the DmORFl gene product in vitro produced a polypeptide 
of 68 kDa, consistent with the predicted molecular weight of the ORF. Translation of 
DmORFl in the presence of canine pancreatic microsomes results in synthesis of a protein 
with reduced electrophoretic mobility, consistent with glycosylation of the nascent 
polypeptide. Treatment of glycosylated DmORF with EndoH increased its relative 
mobility as expected upon removal of carbohydrate moieties. Thus, the pDmORFl insert 
is capable of directing the expression of a glycoprotein with the expected molecular 
weight. EndoH treatment removes carbohydrate residues consistent with the sugar added 
through N-linked glycosylation. 
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EXAMPLE 12 

High-affinity K + uptake and selectivity of DmORFl expressed in yeast. 

Expression of DmORF permits CY162 cells to grow on medium 
containing a low concentration of potassium, implying that DmORFl supplies high 
affinity potassium uptake capacity. To characterize the potassium uptake properties of 

86 

CY162 cells containing DmORFl, Rb uptake studies were performed. Examination of 
the uptake of this potassium congener revealed important aspects of potassium uptake by 
DmORFl. 

Yeast strains containing heterologous potassium-expression plasmids 
CY162-DmORFl, CY162-pKAT and the control strain CY162-pYES2 (Clontech) were 
cultured overnight in SC Gal-ura containing 0.1 M KC1. The cells were harvested, 
washed with sterile doubled distilled water and starved for K + for 6 hours in Ca-MES 

o 

buffer. Cells were washed again and distributed to culture tubes (10 cells/tube) 

86 

containing RbCl in Ca-MES buffer. The tubes were incubated at room temperature, 
samples filtered at various time intervals and counted. 86 Rb uptake into cells was 
displayed. 

The high-affinity potassium uptake capacity encoded by DmORFl permits 

86 86 

high-affinity uptake of the potassium congener, Rb, as well. Barium inhibited Rb 
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uptake. No high affinity 86 Rb uptake is observed in control CY162-pYES2 cells and ™Rb 
uptake into CY162-pKAT cells is consistent with its published properties. 



Expression of Drosophila melanogaster potassium channels in yeast. 

Voltage-gated potassium channel diversity in the fruitfly Drosophila 
melanogaster is encoded in large part by six genes, Shaker, Shab, Shal, Shaw, Eag, and 
Slo. Expression of these potassium channels in yeast will permit their introduction into 
screening assays for novel insecticidal compounds and facilitate characterization of their 
ion channel properties and sensitivity to compounds with activating and inhibitory 
properties. 

DNA sequences encoding Drosophila melanogaster potassium channels 
were amplified by PCR using synthetic oligonucleotides that add 5 r Hindlll or Kpn I ? sites 
and y Xbal, SphI, or Xhol sites: 



EXAMPLE 13 



Shaker 5': 



AAAAAGCTTAAAATGGCACACATCACG [SEQ ID NO:24] 



Shaker 3': 



AAACTCGAGTCATACCTGTGGACT [SEQ ID NO:25] 



Shab 5': 



AAAAAGCTTAAAATGGTCGGGCAATTG [SEQ ID NO:26] 
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AAAAGCATGCTCATCTGGATGGGCA [SEQ ID NO:27] 



Shal 5': AAAAAGCTTAAAATGGCCTCGGTCGCC [SEQ ID NO:28] 

Shal 3': TTTTCTAGACTACATCGTTGTCTT [SEQ ID NO:29] 

Shaw 5': AAAAAGCTTAAAATGAATCTGATCAAC [SEQ ID NO:30] 
Shaw 3': AAATCTAGATTAGTCGAAACTGAA [SEQ ID NO:3 1] 

Eag 5': AAAAAGCTTAAAATGCCTGGCGGA [SEQ ID NO:32] 

Eag 3': AAATCTAGAGGCTACAGGAAGTCC [SEQ ID NO:33] 

Slo 5': GGGGGTACCAAAATGTCGGGGTGTGAT [SEQ ID NO:34] 

Slo 3': TTTTTCTAGATCAAGAGTTATCATC [SEQ ID NO:35] 

Plasmids used as templates for the PCR reactions were: pBSc-DShakerH37, pBSc- 
dShabl 1, pBSc-dShal2+(A)36, pBScMXT-dShaw [A. Wei et al, Science 248, 599-603 
(1990), provided by L. SalkoffJ, pBScMXT-slo,v4 [Atkinson et al, Science 253, 551- 
555, (1991), provided by L. SalkoffJ, and pBIMCH20 Eag [CH20] [Warmke et al, 
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Science 252, 1560-1564 (1991), Bruggemann et al 9 Nature 365, 445-448 (1993), 
provided by B. Ganetzky]. 

Amplified fragments were digested with the appropriate restriction 
endonucleases, purified using GeneClean (Bio 101), and ligated into corresponding sites 
in pYES2 (Invitrogen). CY162 cells were transformed with assembled Drosophila 
melanogaster potassium channel expression plasmids by the LiCl method and plated on 
SCD-ura containing 0.1M KC1 agar medium. Selected transformants were tested for 
growth on arginine-phosphate-galactose (2 %)/sucrose (0.2 %)-ura agar medium 
containing 1-5 mM KC1. CY162 cells containing pKATl or pDmORFl were cultured as 
positive controls and CY162 cells containing pYES2 were grown to provide a negative 
control. 

CY162 cells bearing Drosophila melanogaster potassium channel 
expression plasmids survive under conditions in which growth is dependent on functional 
potassium channel expression. At potassium ion concentrations between 1-3 mM, 
negative control CY162 cells containing pYES2 grow poorly. Expression of the 
Drosophila melanogaster potassium channels Shal, Shaw and Eag substantially improve 
growth of CY162. These results are consistent with the Drosophila melanogaster 
potassium channels providing high-affinity potassium uptake capacity. This capacity is 
apparently sufficient to replace the native high-affinity potassium transport capacity 
encoded by TRK1 which is lacking in CY162 (trkl trk2) cells. 
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EXAMPLE 14 



Cloning of a novel C elegans sequence with homology to potassium channels. 

In order to expand the applicability of this technology to discover 
compounds with novel anhelmenthic activity, CY162 cells were transformed with a 
pYES2-based yeast expression library constructed using cDNA synthesized from C. 
elegans mRNA (Invitrogen). Plasmid DNA isolated from yeast cells that survived the 
selection scheme described in EXAMPLE 1 were subjected to automated DNA sequence 
analysis performed by high temperature cycle sequencing (Applied Biosystems). 
Geneworks DNA sequence analysis software (Intelligenetics) is used to align raw DNA 
sequence information and to identify open reading frames. The DNA sequence of the 1 .4 
kb insert in pCORK is displayed in FIGURE 9A and 9B [SEQ ID NO:36]. The 5' 
untranslated sequences of the cDNA are present in this construct. A single long open 
reading frame sufficient to encode a protein of 434 amino acids (predicted MW 48 kDa) is 
predicted in pCORK. A consensus polyadenylation site, AATAAA, occurs at position 
1359-1364 in 3' untranslated sequences and is followed by a tract of 15 consecutive A 
residues. The CORK ORF contains structural features that resemble pore forming H5 
domains found in potassium channels. Two putative pore forming H5 domains (residues 



-59- 



32,421-C2 



Express Mail No. TB663627987US 



76-39 and 150-162) contain the G-Y/F-G tripeptide motif required for potassium 
selectivity [Heginbotham et ai 9 Science 258, 1152-1155, (1992)]. 

EXAMPLE 15 

Cloning of the Human Two-Pore Potassium Channel Sequence: hORKL 

Materials and Methods 

DNA sequences encoding a human putative two-pore potassium channel were cloned by 
polymerase chain reaction (PCR) from human brain cDNA. Degenerate oligonucleotides 
(5 f and 3' oligo) used in the analysis were designed from a compilation of nucleotide 
sequences encoding the pore-forming domains of putative two pore potassium channels 
identified in a search of the GENBANK DNA sequence database. 

Oligos used in degenerate PCR cloning approach 

5' oligo: 5' TIG GAT (AT)(CT)G G(AT)G A(CT)(AT) T [SEQ ID NO:39] 

3' oligo: 5' (AG)TC (AT)CC (AG)(AT)A (ACT)CC (AGT)A(CT) (AGT)GT [SEQ ID 
NO:40] 

Clontech QUICK-Clone human brain cDNA was used as template (1 ng cDNA in 20 jjlI 
reaction) in a reaction mixture containing 1.25 U AmpliTaq DNA Polymerase (Perkin- 
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Elmer), 1 [iM primers, 200 \xM dNTPs. PCR was carried out by standard procedures using 
the cycles given below in a Perkin-Elmer 9600 thermocycler. 

PCR: 94°2' 1 cycle 

94°30" 

48°30 M 35 cycles 

60" ramp to 72° 

72°30" 

72° 10' 

The resulting PCR fragments were cloned into the Invitrogen TA cloning kit according to 
manufacturers instructions. The cloned DNA fragments were sequenced with ABI Ready 
Reaction DyeDeoxy Terminator Cycle Sequencing Kit on the ABI373 Automated DNA 
sequencer according to manufacturers instructions. One fragment contained a 339 base pair 
(bp) open reading frame (ORF) with two consensus pore forming domains separated by two 
putative transmembrane domains. In order to clone the complete DNA sequence encoding 
hORKl, fragments corresponding to 5' and 3' sequences were isolated from fetal brain 
Marathon Ready cDNA (Clontech) using a rapid analysis of cDNA ends (RACE) procedure 
according to manufacturers instructions. The oligos used to clone 5' and 3' fragments were 
defined by the DNA sequence encoding the ORF, allowing for a 150 bp overlap between 5' 
and 3' fragments. 

Oligos used in the RACE procedure: 
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for 5 f fragment CGC AGG CAG AGC CAC AAA GAG TAC ACA G [SEQ ID NO:41] 
for 3' fragment GGA GAT CAG CTA GGC ACC ATA TTT GG [SEQ ID NO:42] 

A 1060 bp 5' fragment was isolated which, after DNA sequence analysis, was found to 
contain a 208 bp 5 f untranslated region (UTR) and 852 bp ORF encoding 284 amino acids. 
Similarly, a 2000 bp 3' fragment was isolated which contained a 432 bp ORF capable of 
encoding an additional 144 amino acids along with an extensive 3 T UTR. A DNA fragment 
containing the complete hORKl ORF sequence was generated by PCR-mediated fusion of 
the 5 1 and 3' fragments. The isolated 5' and 3' fragments were added together to a PCR 
reaction mixture containing oligos corresponding to 14 nucleotides upstream of the ATG 
and the first 12 nucleotides of the ORF and the complement of the 20 nucleotides after the 
stop codon. 

Oligos used to clone the complete hORKl ORF 

5' ATG CTG CAT GCC TCA TGC TTC CCA GC [SEQ ID NO:43] 

3' GGT TAT TTA AAG AGA GGG CT [SEQ ID NO:44] 

The full length hORKl ORF fragment was isolated and cloned into the Invitrogen TA 
cloning kit according to manufacturers instructions. DNA sequence analysis confirmed the 
presence of a single ORF sufficient to encode a protein of 426 amino acids. The complete 
amino acid and DNA sequences are as follows: 

MLPSASRERPGYRAGVAAPDLLDPKSAAQNSKPRLSFSTKPTVLASRVESDTT 
INVMKWKTVSTIFLVVVLYLIIGATVFKALEQPHEISQRTTIVIQKQTFISQHSC 
VNSTELDELIQQIVAAINAGIIPLGNTSNQISHWDLGSSFFFAGTVITTIGFGNISP 
RTEGGKIFCIIYALLGIPLFGFLLAGVGDQLGTIFGKGIAKVEDTFIKWNVSQTK 
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IRIISTIIFILFGCVLFVALPAIIFKHIEGWSALDAIYFVVITLTTIGFGDYVAGGSD 
IEYLDFYKPVVWFWILVGLAYFAAVLSMIGRLVRVISKKTKEEVGEFRAHAA 
EWTANVTAEFKETRRRLSVEIYDKFQRATSIKRKLSAELAGNHNQELTPCRRT 
LSVNHLTSERDVLPPLLKTESIYLNGLAPHCAGEEIAVIENIK [SEQ IDNO:45] 

ccatcctaatacgactcactatagggctcgagcgnccgcccgggcagtaaaatgcctgcccgtgcagctcggagcgcgc 

agcccgtctctgaataagaagtgagtacaatggcgtgtttgtaaaaaaaagcttcaagtccgtctttttcaaaaaacatttt 

tgctgcatgcctcATGCTTCCCAGCGCCTCGCGGGAGAGACCCGGCTATAGAGCA 

GGAGTGGCGGCACCTGACTTGCTGGATCCTAAATCTGCCGCTCAGAACTC 

CAAACCGAGGCTCTCATTTTCCACGAAACCCACAGTGCTTGCTTCCCGGGT 

GGAGAGTGACACGACCATTAATGTTATGAAATGGAAGACGGTCTCCACGA 

TATTCCTGGTGGTTGTCCTCTATCTGATCATCGGAGCCACCGTGTTCAAAG 

CATTGGAGCAGCCTCATGAGATTTCACAGAGGACCACCATTGTGATCCAG 

AAGCAAACATTCATATCCCAACATTCCTGTGTCAATTCGACGGAGCTGGA 

TGAACTCATTCAGCAAATAGTGGCAGCAATAAATGCAGGGATTATACCGT 

TAGGAAACACCTCCAATCAAATCAGTCACTGGGATTTGGGAAGTTCCTTCT 

TCTTTGCTGGCACTGTTATTACAACCATAGGATTTGGAAACATCTCACCAC 

GCACAGAAGGCGGCAAAATATTCTGTATCATCTATGCCTTACTGGGAATT 

CCCCTCTTTGGTTTTCTCTTGGCTGGAGTTGGAGATCAGCTAGGCACCATA 

TTTGGAAAAGGAATTGCCAAAGTGGAAGATACGTTTATTAAGTGGAATGT 

TAGTCAGACCAAGATTCGCATCATCTCAACAATCATATTTATACTATTTGG 

CTGTGTACTCTTTGTGGCTCTGCCTGCGATCATATTCAAACACATAGAAGG 

CTGGAGTGCCCTGGACGCCATTTATTTTGTGGTTATCACTCTAACAACTAT 

TGGATTTGGTGACTACGTTGCAGGTGGATCCGATATTGAATATCTGGACTT 

CTATAAGCCTGTCGTGTGGTTCTGGATCCTTGTAGGGCTTGCTTACTTTGCT 

GCTGTCCTGAGCATGATTGGGAGATTGGTCCGAGTGATATCTAAAAAGAC 

AAAAGAAGAGGTGGGAGAGTTCAGAGCACACGCTGCTGAGTGGACAGCC 

AACGTCACAGCCGAATTCAAAGAAACCAGGAGGCGACTGAGTGTGGAGA 

TTTATGACAAGTTCCAGCGGGCCACCTCCATCAAGCGGAAGCTCTCGGCA 

GAACTGGCTGGAAACCACAATCAGGAGCTGACTCCTTGTAGGAGGACCCT 

GTCAGTGAACCACCTGACCAGCGAGAGGGATGTCTTGCCTCCCTTACTGA 

AGACTGAGAGTATCTATCTGAATGGTTTGGCGCCACACTGTGCTGGTGAA 

GAGATTGCTGTGATTGAGAACATCAAATAGccctctctttaaataaccttaggcatagccatag 

gtgaggacttctctatgctctttatgactgttgctggt^ 

cacccatcatggtcatctatcatcaagagaatttggaattctgagccagcactttctttctgatgatgcttgttgaacggcccact 

ttctttgatgagtggaatgacaagcaatgtctgatgcctttgtgtgcccagactgttttcctctctctttccctaatgtgccata^ 

gcctcagaatgaattgagaattgtttctggtaacaatgtagctttgaggga 

cctagatatggaccatttatggatgacaacaattUtltULgtaaatgacaagaaattcttatgcagccttttacctaagaaatttct 
gtcagtgccttatcttatgaagaaacagaacctctctagctaatgtgtggtttctccttccctgcccccacccctaggctcacct 
ctgcagtcttttaccccagttctcccatttgaataccataccttgntggaaacagngtgtaaaatgactgaagtgatgatgccg 
aagatgaaatagatgncaaattagntggacattga [SEQ ID NO:46] 
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The hORKl ORF was amplified using oligos that added restriction 
endonuclease cleavage sites appropriate for insertion into the yeast expression vectors 
pLPlOO and pYES2 (Invitrogen). The corresponding hORKl expression plasmids, pLPl 55 
and pLPl 56, were constructed using standard molecular biological methodology and used to 
transform S. cerevisiae CY162 cells using the lithium acetate method. The resulting yeast 
strains were examined for their ability to grow on standard synthetic agar media containing 
a low concentration of KCL Expression of hORKl in CY162 cells supports their growth on 
low (2-3 mM KC1) potassium media. Growth was observed to be more extensive when 
hORKl was expressed under control of the ADH1 promoter (pLP155) than with the 
GAL1/10 promoter (pLP156). The growth of hORKl -containing CY162 cells was 

inhibited by the known potassium channel blockers Ba 2+ , Ca 2+ , Cs + , and quinine, but not 
by TEA. The oligos used for the cloning of 5 f and 3' RACE fragments were used in this 
analysis as well. 

Oligos used to clone the hORKl ORF into pLPlOO: 

5' AAA AGA TCT AAA ATG CTT CCC AGC GCC [SEQ ID NO:47] 

3' AAA GTC GAC CTA TTT GAT GTT CTC AAT [SEQ ID NO:48] 

Oligos used to clone the hORKl ORF into pYES2: 

5' AAA AAG CTT AAA ATG CTT CCC AGC GCC [SEQ ID NO:49] 

3' AAA TCT AGA CTA TTT GAT GTT CTC AAT [SEQ ID NO:50] 
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Northern blotting analysis of hORKl expression in human tissues 
indicates that a 3.5 kb mRNA is expressed predominately in brain. The hORKl 
transcript was not detected in heart, placenta, lung, liver, kidney or pancreas. Analysis 
of blots confining RNA from separate regions of the brain was examined and further 
localized high levels of hORKl expression in the caudate nucleus, amygdala, putamen, 
frontal lobe, hippocampus, and spinal cord. The hORKl transcript is present at 
significantly lower levels in other regions of the brain; cerebellum, cerebral cortex, 
medulla, occipital lobe, temporal lobe, corpus callosum, substantia nigra, subthalamic 
nucleus, and thalamus. 



EXAMPLE 16 



2P channels obtained by searching the EST database. 

The GENBANK expressed sequence tag database (dbEST) was searched for 
putative 2P channel coding sequences using the program TBLASTN to compare all 
open reading frames to the amino acid sequence of hORKl. Several sequences 
corresponding to TW1K were identified. In addition, one human and five murine 
cDNA sequences different than TWIK were identified. The five cDNAs were 
purchased (ATCC, Genome Systems Inc.) and subjected to automated DNA sequence 
analysis. 
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A predicted open reading frame found in partial human cDNA sequence 

(GENBANK accession # n39619) apparently encodes a portion of a unique putative 

2P channel. DNA sequence analysis of the purchased cDNA clone (2771 13, SEQ ID 

NO:51) revealed the presence of a single long open reading frame: 

AACAAAAACCTTTTTTGTTTTGAATGGCCTAGAGAGGGTAAGGGATCCCCT 

GACGAACAGGAGCAGAGCCAGCTAGAACCTGGGCCTGGCCAGTTCAAGG 

CCACCAGAGGGCAGCCTTCTGCGGAAGGCAGTATTGGGGTAGGCAGGGA 

CCCCAGCAGACATGGCACTCAGAGCTCTCACTGTCCACTGACTCTCTCTTC 

TCCAGGTTATGGCCACATGGCCCCACTATCGCCAGGCGGAAAGGCCTTCT 

GCATGGTCTTANTAGCCCTTGGGCTGCCAGCCTCCTTAGCTCTCGTGGCCA 

CCCTGCGCCATTGCCTGCTGCCTGTGCTCAGCCGCCCACGTGCCTGGGTAG 

CGGTCCACTGGCAGCTGTCACCGGCCAGGGCTGCGCTGCTGCAGGCAGTT 

GCACTGGGACTGCTGGTGGCCAGCAGCTTTGTGCTGCTGCCAGCGCTGGT 

GCTGTGGGGCCTTCAGGGCGACTGCAGCCTGCTGGGGGCCGTCTACTTCT 

GCTTCAGCTCGCTCAGCACCATTGGCCTGGGG 



The predicted translation product contains amino acid motifs corresponding to 
pore forming domains, transmembrane domains, and Z4XXX2X3GX4PX5 consensus 
sequences: 

asn lys asn leu phe cys phe glu trp pro arg glu gly lys gly ser pro asp gin glu glu gin 
ser gin leu glu pro gly pro gly gin phe lys ala thr arg gly gin pro ser ala glu gly ser ile 
gly val gly arg asp pro ser arg his gly thr gin ser ser his cys pro leu thr leu ser ser pro 
gly tyr gly his met ala pro leu ser pro gly gly lys ala phe cys met val leu xxx ala leu 
gly leu pro ala ser leu ala leu val ala thr leu arg his cys leu leu pro val leu ser arg pro 
arg ala trp val ala val his trp gin leu ser pro ala arg ala ala leu leu gin ala val ala leu 
gly leu leu val ala ser ser phe val leu leu pro ala leu val leu trp gly leu gin gly asp cys 
ser leu leu gly ala val tyr phe cys phe ser ser leu ser thr ile gly leu gly [SEQ ID 
NO:54] 
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NKNLFCFEWPREGKGSPDEQEQSQLEPGPGQFKATRGQPSAEGSIGVGRDPSR 
HGTQSSHCPLTLSSPGYGHMAPLSPGGKAFCMVLXALGLPASLALVATLRHC 
LLPVLSRPRAWVAVHWQLSPARAALLQAVALGLLVASSFVLLPALVLWGLQ 
GDCSLLGAVYFCFSSLSTIGLE 



Four overlapping murine cDNA sequences (w09160, w36852, w36914, 
w99136) contain a predicted open reading frame sufficient to encode a portion of a 
unique putative 2P channel. DNA sequence analysis of the purchased cDNA clones 
(303895, 421453, 334194, 421453) revealed the presence of amino acid motifs 
corresponding to pore forming domains, transmembrane domains, and 
Z4X1X2X3GX4PX5 consensus sequences: 

ATGATACGATTTAATACGACTCACTATAGGGAATTTGGCCCTCGAGGCCA 

AGAATTCGGCACGAGGAGAATGTGCGCACGTTGGCTCTCATCGTGTGCAC 

CTTCACCTACCTGCTGGTGGGCGCCGCGGTGTTCGACGCACTGGAGTCGG 

AGCCGGAGATGATCGAGCGGCAGCGGCTGGAGCTGCGGCAGCTGGAGCT 

GCGGGCGCGCTACAACCTCAGCGAGGGCGGCTACGAGGAGCTGGAGCGC 

GTCGTGCTGCGCCTCAAGCCGCACAAGGCCGGCGTGCAGTGGCGCTTCGC 

CGGCTCCTTCTACTTCGCCATCACCGTCATCACCACCATCGGCTATGGTCA 

TGCGGCGCCCAGCACGGACGGAGGCAAGGTGTTCTGCATGTTCTACGCGC 

TGCTGGGCATCCCGCTCACACTAGTCATGTTCCAGAGCCTGGGTGAACGC 

ATCAACACCTCCGTGAGGTACCTGCTGCACCGTGCCAAGAGGGGGCTGGG 

CATGCGGCACGCCGAAGTGTCCATGGCCAACATGGTGCTCATCGGTTTCG 

TGTCGTGCATCAGCACGCTGTGCATCGGCGCAGCTGCCTTCTCCTACTACG 

AGCGCTGGACTTTCTTCCAGGCCTATTACTACTGCTTCATCACCCTCACCA 

CCATCGGCTTCGGCGACTATGTGGCGCTGCAGAAGGACCAGGCGCTGCAG 

ACGCAGCCGCAGTATGTGGCTTCAGCTTCGTGTACATCCTCACGGGCTCAC 

GGTCATCGGCGCTTCCTCAACCTCGTGGTGCTGCGATTCATGACCATGAAC 

GCCGAGGACGAGAAGCGTGATGCGGAGCACCGCGCCCTGCTCACGCACA 

ACGGCCAGGCTGTCGGCCTGGGTGGCCTGAGCTGCCTGAGCGGTAGCCTG 

GGCGACGGCGTGCGTCCCCGCGACCCAGTCACATGCGCTGCGGCCGCAAG 

CTTA [SEQIDNO:52] 
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gly ile trp pro ser arg pro arg ile arg his glu glu asn val arg thr leu ala leu ile val cys 
thr phe thr tyr leu leu val gly ala ala val phe asp ala leu glu ser glu pro glu met ile glu 
arg gin arg leu glu leu arg gin leu glu leu arg ala arg tyr asn leu ser glu gly gly tyr glu 
glu leu glu arg val val leu arg leu lys pro his lys ala gly val gin trp arg phe ala gly ser 
phe tyr phe ala ile thr val ile thr thr ile gly tyr gly his ala ala pro ser thr asp gly gly lys 
val phe cys met phe cys met phe tyr ala leu leu gly ile pro leu thr leu val met phe gin 
ser leu gly glu arg ile asn thr ser val arg tyr leu leu his arg ala lys arg gly leu gly met 
arg his ala glu val ser met ala asn met val leu ile gly phe val ser cys ile ser thr leu cys 
ile gly ala ala ala phe ser tyr tyr glu arg trp thr phe phe gin ala tyr tyr tyr cys phe ile 
thr leu thr thr ile gly phe gly asp tyr val ala leu gin lys asp gin ala leu gin thr gin pro 
gin tyr val ala ser ala ser cys thr ser ser arg ala his gly his arg arg phe leu asn leu val 
val leu arg phe met thr met asn ala glu asp glu lys arg asp ala glu his arg ala leu leu thr 
his asn gly gin ala val gly leu gly gly leu ser cys leu ser gly ser leu gly asp gly val arg 
pro arg asp pro val thr cys ala ala ala ala ser leu [SEQ ID NO: 5 5] 

GIWPSRPRIRHEENVRTLALIVCTFTYLLVGAAVFDALESEPEMIERQRLELRQ 

LELRARYNLSEGGYEELERVVLRLKPHKAGVQWRFAGSFYFAITVITTIGYGH 

AAPSTDGGKVFCMFYALLGIPLTLVMFQSLGERINTSVRYLLHRAKRGLGMR 

HAEVSMANMVLIGFVSCISTLCIGAAAFSYYERWTFFQAYYYCFITLTTIGFGD 

YVALQKDQALQTQPQWASASCTSSRAHGHRRFLNLVVLRFMTMNAEDEKR 

DAEHRALLTHNGQAVGLGGLSCLSGSLGDGVRPRDPVTCAAAASL 

Tissue distribution of mRNA expression determined by northern blotting analysis 
using a probe constituting a fragment of the open reading frame indicated high level 
expression in heart tissue. 

A predicted open reading frame found in partial murine cDNA sequence 
(GENBANK accession # wl8545) apparently encodes a portion of a unique putative 
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2P channel. DNA sequence analysis of the purchased cDNA clone (333546) revealed 
the presence of a single long open reading frame: 

CTGAAACCATGGGCCCGATACCTGCTCCTGCTTATGGCCCACCTGCTGGCC 

ATGGGCCTTGGGGCTGTGGTGCTTCAGGCCCTGGAGGGCCCTCCAGCTCG 

CCACCTCCAGGCCCAGGTCCAGGCTGAACTGGCTAGCTTCCAGGCAGAGC 

ACAGGGCCTGCTTGCCACCTGAGGCCCTGGAGGAGCTGCTAGGTGCGGTC 

CTGAGAGCACAGGCCCATGGAGTTTCCAGCCTGGGCAACAGCTCANAGAC 

AAGCAACTGGGATCTGCCCTCAGCTCTGCTGTTCACTGCCAGCATCCTCAC 

CACCACCGGTTATGGCCACATGGCCCCACTCTCCTCAGGTGGAAAGGCCT 

TCTGTGTGGTCTATGCAGCCCTTGGGCTGCCAGCCTCTCTAGCACTTGTGG 

CTGCCCTGCGCCACTGCTTGCTGCCTGTGTTCAGTCGCCCAGGTGACTGGG 

TAGCCATTCGCTGGCAGCTGGCACCAGCTCAGGCTGCTCTGCTACAGGCA 

GCAGGACTGGGCCTCCTGGTGGCCTGTGTCTTCATGCTGCTGCCAGCACTG 

GTGCTGTGGGGTGTACAGGGTGACTGGCAGCCTGCTANAACCATCTACTT 

CTGTTTCGGCTCACTCAGCACGATCGGCCTAGGAGACTTGCTGCCTGCCCA 

TGGACGTGGCCTGCACCCAGCCATTTACCACCTTGGGCAGTTTGCACTTCT 

TGGTTACTTGCTCCTGGGGCTCCTGGCCATGTTGTTAGCAGTAGAGACCTT 

CTCAGAGCTGCCTCAGGTCCGTGCCATGGTGAAATTCTTTGGGCCCAGTGG 

CTCTAGAACCGATGAAGATCAAGATGGCATCCTAGGCCAAGATGAGCTGG 

CTCTGAGCACTGTGCTGCCTGACGCCCCAGTCTTGGGACCAACCACCCCA 

GCCTGAGCGGGAGGCACCAAGGAGTGCTTGAAGAACATAGCANGAAGGG 

TTATGGGAATGAATATGTCATGGGATAATGTTAATTTTAAAAATTAAATGG 

GCTGCTTAGCATGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

[SEQ ID NO:53] 



The predicted translation product contains amino acid motifs corresponding to pore 
forming domains, transmembrane domains, and Z4X1X2X3GX4PX5 consensus 
sequences: 

leu lys pro trp ala arg tyr leu leu leu leu met ala his leu leu ala met gly leu gly ala val 
val leu gin ala leu glu gly pro pro ala arg his leu gin ala gin val gin ala glu leu ala ser 
phe gin ala glu his arg ala cys leu pro pro glu ala leu glu glu leu leu gly ala val leu arg 
ala gin ala his gly val ser ser leu gly asn ser ser xxx thr ser asn trp asp leu pro ser ala 
leu leu phe thr ala ser ile leu thr thr thr gly tyr gly his met ala pro leu ser ser gly gly 



-69- 



32,42 1-C2 



Express Mail No. TB663627987US 



lys ala phe cys val val tyr ala ala leu gly leu pro ala ser leu ala leu val ala ala leu arg 
his cys leu leu pro val phe ser arg pro gly asp tip val ala ile arg trp gin leu ala pro ala 
gin ala ala leu leu gin ala ala gly leu gly leu leu val ala cys val phe met leu leu pro ala 
leu val leu trp gly val gin gly asp trp gin pro ala xxx thr ile tyr phe cys phe gly ser leu 
ser thr ile gly leu gly asp leu leu pro ala his gly arg gly leu his pro ala ile tyr his leu 
gly gin phe ala leu leu gly tyr leu leu leu gly leu leu ala met leu leu ala val glu thr phe 
ser glu leu pro gin val arg ala met val lys phe phe gly pro ser gly ser arg thr asp glu 
asp gin asp gly ile leu gly gin asp glu leu ala leu ser thr val leu pro asp ala pro val leu 
gly pro thr thr pro ala [SEQ ID NO:56] 

LKPWARYLLLLMAHLLAMGLGAVVLQALEGPPARHLQAQVQAELASFQAE 
HRACLPPEALEELLGAVLRAQAHGVSSLGNSSXTSNWDLPSALLFTASILTTT 
G YGHMAPL S S GGKAFC V V YAALGLP ASL ALVAALRHCLLP VF SRPGD WV AI 
RWQLAPAQAALLQAAGLGLLVACVFMLLPALVLWGVQGDWQPAXTIYFCF 
GSLSTIGLGDLLPAHGRGLHPAIYHLGQFALLGYLLLGLLAMLLAVETFSELP 
QVRAMVKFFGPSGSRTDEDQDGILGQDELALSTVLPDAPVLGPTTPA 
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(C) CLASSIFICATION: 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 190.. 2043 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ACGCGATCGC CGCGAGTGTA TATTTTTTTT TTAGCTCAGT CTTCAGTGTT TCGCGATTCT 6 0 

CTTTAAAAGA AAAAAAAAAT AATAAGTCAA AACTACAAAC CACACAGCGA AAGGCGAAAG 12 0 

CAACGGTTCC TGCGAGTGTT TATTTTTTTT TTCAACAATT TTTGATCGTA GTGCGACAAT 18 0 



CCGTCGAGC ATG TCG CCG AAT CGA TGG ATC CTG CTG CTC ATC TTC TAC 
Met Ser Pro Asn Arg Trp lie Leu Leu Leu lie Phe Tyr 
15 10 



228 



ATA TCC TAC CTG ATG TTC GGG GCG GCA ATC TAT TAG CAT ATT GAG CAC 276 
lie Ser Tyr Leu Met Phe Gly Ala Ala lie Tyr Tyr His lie Glu His 
15 20 25 

GGC GAG GAG AAG ATA TCG CGC GCC GAA CAG CGC AAG GCG CAA ATT GCA 324 
Gly Glu Glu Lys He Ser Arg Ala Glu Gin Arg Lys Ala Gin He Ala 
30 35 40 45 

ATC AAC GAA TAT CTG CTG GAG GAG CTG GGC GAC AAG AAT ACG ACC ACA 3 72 

He Asn Glu Tyr Leu Leu Glu Glu Leu Gly Asp Lys Asn Thr Thr Thr 
50 55 60 

CAG GAT GAG ATT CTT CAA CGG ATC TCG GAT TAC TGT GAC AAA CCG GTT 420 
Gin Asp Glu He Leu Gin Arg He Ser Asp Tyr Cys Asp Lys Pro Val 
65 70 75 

ACA TTG CCG CCG ACA TAT GAT GAT ACG CCC TAC ACG TGG ACC TTC TAC 468 
Thr Leu Pro Pro Thr Tyr Asp Asp Thr Pro Tyr Thr Trp Thr Phe Tyr 
80 85 90 

CAT GCC TTC TTC TTC GCC TTC ACC GTT TGC TCC ACG GTG GGA TAT GGG 516 
His Ala Phe Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly 
95 100 105 

AAT ATA TCG CCA ACC ACC TTC GCC GGA CGG ATG ATC ATG ATC GCG TAT 564 
Asn He Ser Pro Thr Thr Phe Ala Gly Arg Met He Met He Ala Tyr 
HO 115 120 125 

TCG GTG ATT GGC ATC CCC GTC AAT GGT ATC CTC TTT GCC GGC CTC GGC 612 
Ser Val He Gly He Pro Val Asn Gly He Leu Phe Ala Gly Leu Gly 
130 135 140 

GAA TAC TTT GGA CGT ACG TTT GAA GCG ATC TAC AGA CGC TAC AAA AAG 660 
Glu Tyr Phe Gly Arg Thr Phe Glu Ala He Tyr Arg Arg Tyr Lys Lys 
145 150 155 

TAC AAG ATG TCC ACG GAT ATG CAC TAT GTC CCG CCG CAG CTG GGA TTG 708 
Tyr Lys Met Ser Thr Asp Met His Tyr Val Pro Pro Gin Leu Gly Leu 
160 165 170 

ATC ACC ACG GTG GTG ATT GCC CTG ATT CCG GGA ATA GCT CTC TTC CTG 756 
He Thr Thr Val Val He Ala Leu He Pro Gly He Ala Leu Phe Leu 
175 180 185 

GTG CTG CCC TGC GTG GGT GTT CAC CTA CTT CGA GAA CTG GGC CTA TCT 8 04 

Val Leu Pro Cys Val Gly Val His Leu Leu Arg Glu Leu Gly Leu Ser 
190 195 200 205 

TCC ATC TCG CTG TAC TAC AGC TAT GTG ACC ACC ACA ACA ATT GGA TTC 8 52 

Ser He Ser Leu Tyr Tyr Ser Tyr Val Thr Thr Thr Thr He Gly Phe 
210 215 220 

GGT GAC TAT GTG CCC ACA TTT GGA GCC AAC CAG CCC AAG GAG TTC GGC 90 0 

Gly Asp Tyr Val Pro Thr Phe Gly Ala Asn Gin Pro Lys Glu Phe Gly 
225 230 235 

GGC TGG TTC GTG GTC TAT CAG ATC TTT GTG ATC GTG TGG TTC ATC TTC 948 
Gly Trp Phe Val Val Tyr Gin He Phe Val He Val Trp Phe He Phe 
240 245 250 

TCG CTG GGA TAT CTT GTG ATG ATC ATG ACA TTT ATC ACT CGG GGC CTC 996 
Ser Leu Gly Tyr Leu Val Met He Met Thr Phe He Thr Arg Gly Leu 
255 260 265 

CAG AGC AAG AAG CTG GCA TAC CTG GAG CAG CAG TTG TCC TCC AAC CTG 1044 
Gin Ser Lys Lys Leu Ala Tyr Leu Glu Gin Gin Leu Ser Ser Asn Leu 
270 275 280 285 

AAG GCC ACA CAG AAT CGC ATC TGG TCT GGC GTC ACC AAG GAT GTG GGC 1092 



Lys Ala Thr Gin Asn Arg lie Trp Ser Gly Val Thr Lys Asp Val Gly 
290 295 300 



TAC CTC CGG CGA ATG CTC AAC GAG CTG TAC ATC CTC AAA GTG AAG CCT 114 0 

Tyr Leu Arg Arg Met Leu Asn Glu Leu Tyr lie Leu Lys Val Lys Pro 
305 310 315 

GTG TAC ACC GAT GTA GAT ATC GCC TAC ACA CTG CCA CGT TCC AAT TCG 1188 
Val Tyr Thr Asp Val Asp lie Ala Tyr Thr Leu Pro Arg Ser Asn Ser 
320 325 330 

TGT CCG GAT CTG AGC ATG TAC CGC GTG GAG CCG GCT CCC ATT CCC AGC 1236 
Cys Pro Asp Leu Ser Met Tyr Arg Val Glu Pro Ala Pro lie Pro Ser 
335 340 345 

CGG AAG AGG GCA TTC TCC GTG TGC GCC GAC ATG GTT GGC GCC CAA AGG 12 84 

Arg Lys Arg Ala Phe Ser Val Cys Ala Asp Met Val Gly Ala Gin Arg 
350 355 360 365 

GAG GCG GGC ATG GTA CAC GCC AAT TCC GAT ACG GAT CTA ACC AAA CTG 1332 
Glu Ala Gly Met Val His Ala Asn Ser Asp Thr Asp Leu Thr Lys Leu 
370 375 380 

GAT CGC GAG AAG ACA TTC GAG ACG GCG GAG GCG TAC CAC CAG ACC ACC 1380 
Asp Arg Glu Lys Thr Phe Glu Thr Ala Glu Ala Tyr His Gin Thr Thr 
385 390 395 

GAT TTG CTG GCC AAG GTG GTC AAC GCA CTG GCC ACG GTG AAG CCA CCG 1428 
Asp Leu Leu Ala Lys Val Val Asn Ala Leu Ala Thr Val Lys Pro Pro 
400 405 410 

CCG GCG GAA CAG GAA GAT GCG GCT CTC TAT GGT GGC TAT CAT GGC TTC 14 76 

Pro Ala Glu Gin Glu Asp Ala Ala Leu Tyr Gly Gly Tyr His Gly Phe 
415 420 425 

TCC GAC TCC CAG ATC CTG GCC AGC GAA TGG TCG TTC TCG ACG GTC AAC 1524 
Ser Asp Ser Gin lie Leu Ala Ser Glu Trp Ser Phe Ser Thr Val Asn 
430 435 440 445 

GAG TTC ACA TCA CCG CGA CGT CCA AGA GCA CGT GCC TGC TCC GAT TTC 15 72 

Glu Phe Thr Ser Pro Arg Arg Pro Arg Ala Arg Ala Cys Ser Asp Phe 
450 455 460 

AAT CTG GAG GCA CCT CGC TGG CAG AGC GAG AGG CCA CTG CGT TCG AGC 162 0 

Asn Leu Glu Ala Pro Arg Trp Gin Ser Glu Arg Pro Leu Arg Ser Ser 
465 470 475 

CAC AAC GAA TGG ACA TGG AGC GGC GAC AAC CAG CAG ATC CAG GAG GCA 1668 
His Asn Glu Trp Thr Trp Ser Gly Asp Asn Gin Gin lie Gin Glu Ala 
480 485 490 

TTC AAC CAG CGC TAC AAG GGA CAG CAG CGT GCC AAC GGA GCA GCC AAC 1716 
Phe Asn Gin Arg Tyr Lys Gly Gin Gin Arg Ala Asn Gly Ala Ala Asn 
495 500 505 

TCG ACC ATG GTC CAT CTG GAG CCG GAT GCT TTG GAG GAG CAG CTG AGA 1764 
Ser Thr Met Val His Leu Glu Pro Asp Ala Leu Glu Glu Gin Leu Arg 
510 515 520 525 

AAC AAT CAC CGG GTG CCG GTC GCG TCA AGA AGT TCT CCA TGC CGG ATG 1812 
Asn Asn His Arg Val Pro Val Ala Ser Arg Ser Ser Pro Cys Arg Met 
530 535 540 

GTC TGC GAC GTC TGT TTC CCT TCC AGA AGA AGC ACC CCT CGC AGG ATC 186 0 

Val Cys Asp Val Cys Phe Pro Ser Arg Arg Ser Thr Pro Arg Arg lie 
545 550 555 

TGG AGC GCA AGT TGT CCG TGG TCT CGG TAC CCG AGG GTG TCA TCT CGC 19 08 

Trp Ser Ala Ser Cys Pro Trp Ser Arg Tyr Pro Arg Val Ser Ser Arg 



560 



565 



570 



AGG AAG CCA GAT CCC CGC TGG ACT ACT ACA TCA ACA CGG TCA CGG CGG 1956 
Arg Lys Pro Asp Pro Arg Trp Thr Thr Thr Ser Thr Arg Ser Arg Arg 
575 580 585 

CCT CCA GTC AAT CCT ATT TGC GCA ACG GAC GCG GTC CGC CAC CGC CCT 2 004 

Pro Pro Val Asn Pro lie Cys Ala Thr Asp Ala Val Arg His Arg Pro 
590 595 600 605 

TCG AAT CGA ATG GCA GCT TGG CCA GCG GCG GCG GCG GGC TAACGAACAT 2 053 

Ser Asn Arg Met Ala Ala Trp Pro Ala Ala Ala Ala Gly 
610 615 

GGGCTTCCAG ATGGAGGATG GAGCAACCCC GCCATCGGCA TTGGGCGGTG GAGC CTATCA 2113 

ACGCAAGGCG GCTGCTGGCA AGCGCCGACG CGAGAGCATC TACACCCAGA ATCAAGCCCC 2173 

ATCCGCTCGC CGGGGCAGCA TGTATCCGCC GACCGCGCAC GCCTTGGCCC AGATGCAGAT 223 3 

GCGACGCGGC AGCTTGGCAA CCAGTGGCTC TGGATCGGCG GCCATGGCGG CAGTGGCCGC 22 93 

GCGTCGTGGC AGCCTCTTCC CAGCTACAGC ATCGGCATCA TCGCTGACCT CTGCTCCGCG 23 53 

CCGAAGCAGC ATATTCTCGG TTACCTCCGA AAAGGATATG AATGTGCTGG AGCAGACGAC 2413 

CATTGCGGAT CTGATTCGTG CGCTCGAG 2441 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ser Pro Asn Arg Trp lie Leu Leu Leu lie Phe Tyr lie Ser Tyr 
15 10 15 

Leu Met Phe Gly Ala Ala lie Tyr Tyr His lie Glu His Gly Glu Glu 
20 25 30 

Lys lie Ser Arg Ala Glu Gin Arg Lys Ala Gin He Ala He Asn Glu 
35 40 45 

Tyr Leu Leu Glu Glu Leu Gly Asp Lys Asn Thr Thr Thr Gin Asp Glu 
50 55 60 

He Leu Gin Arg He Ser Asp Tyr Cys Asp Lys Pro Val Thr Leu Pro 
65 70 75 80 

Pro Thr Tyr Asp Asp Thr Pro Tyr Thr Trp Thr Phe Tyr His Ala Phe 
85 90 95 

Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn He Ser 
100 105 110 

Pro Thr Thr Phe Ala Gly Arg Met He Met He Ala Tyr Ser Val He 
115 120 125 

Gly lie Pro Val Asn Gly He Leu Phe Ala Gly Leu Gly Glu Tyr Phe 
130 135 140 

Gly Arg Thr Phe Glu Ala He Tyr Arg Arg Tyr Lys Lys Tyr Lys Met 
145 150 155 160 



Ser Thr Asp Met His Tyr Val Pro Pro Gin Leu Gly Leu lie Thr Thr 
165 170 175 

Val Val lie Ala Leu lie Pro Gly lie Ala Leu Phe Leu Val Leu Pro 
180 185 190 

Cys Val Gly Val His Leu Leu Arg Glu Leu Gly Leu Ser Ser lie Ser 
195 200 205 

Leu Tyr Tyr Ser Tyr Val Thr Thr Thr Thr lie Gly Phe Gly Asp Tyr 
210 215 220 

Val Pro Thr Phe Gly Ala Asn Gin Pro Lys Glu Phe Gly Gly Trp Phe 
225 230 235 240 

Val Val Tyr Gin He Phe Val He Val Trp Phe He Phe Ser Leu Gly 
245 250 255 

Tyr Leu Val Met He Met Thr Phe He Thr Arg Gly Leu Gin Ser Lys 
260 265 270 

Lys Leu Ala Tyr Leu Glu Gin Gin Leu Ser Ser Asn Leu Lys Ala Thr 
275 280 285 

Gin Asn Arg He Trp Ser Gly Val Thr Lys Asp Val Gly Tyr Leu Arg 
290 295 300 

Arg Met Leu Asn Glu Leu Tyr He Leu Lys Val Lys Pro Val Tyr Thr 
305 310 315 320 

Asp Val Asp He Ala Tyr Thr Leu Pro Arg Ser Asn Ser Cys Pro Asp 
325 330 335 

Leu Ser Met Tyr Arg Val Glu Pro Ala Pro He Pro Ser Arg Lys Arg 
340 345 350 

Ala Phe Ser Val Cys Ala Asp Met Val Gly Ala Gin Arg Glu Ala Gly 
355 360 365 

Met Val His Ala Asn Ser Asp Thr Asp Leu Thr Lys Leu Asp Arg Glu 
370 375 380 

Lys Thr Phe Glu Thr Ala Glu Ala Tyr His Gin Thr Thr Asp Leu Leu 
385 390 395 400 

Ala Lys Val Val Asn Ala Leu Ala Thr Val Lys Pro Pro Pro Ala Glu 
405 410 415 

Gin Glu Asp Ala Ala Leu Tyr Gly Gly Tyr His Gly Phe Ser Asp Ser 
420 425 430 

Gin He Leu Ala Ser Glu Trp Ser Phe Ser Thr Val Asn Glu Phe Thr 
435 440 445 

Ser Pro Arg Arg Pro Arg Ala Arg Ala Cys Ser Asp Phe Asn Leu Glu 
450 455 460 

Ala Pro Arg Trp Gin Ser Glu Arg Pro Leu Arg Ser Ser His Asn Glu 
465 470 475 480 

Trp Thr Trp Ser Gly Asp Asn Gin Gin He Gin Glu Ala Phe Asn Gin 
485 490 495 

Arg Tyr Lys Gly Gin Gin Arg Ala Asn Gly Ala Ala Asn Ser Thr Met 
500 505 510 

Val His Leu Glu Pro Asp Ala Leu Glu Glu Gin Leu Arg Asn Asn His 
515 520 525 



Arg Val Pro Val Ala Ser Arg Ser Ser Pro Cys Arg Met Val Cys Asp 
530 535 540 

Val Cys Phe Pro Ser Arg Arg Ser Thr Pro Arg Arg lie Trp Ser Ala 
545 550 555 560 

Ser Cys Pro Trp Ser Arg Tyr Pro Arg Val Ser Ser Arg Arg Lys Pro 
565 570 575 

Asp Pro Arg Trp Thr Thr Thr Ser Thr Arg Ser Arg Arg Pro Pro Val 
580 585 590 

Asn Pro lie Cys Ala Thr Asp Ala Val Arg His Arg Pro Ser Asn Arg 
595 600 605 

Met Ala Ala Trp Pro Ala Ala Ala Ala Gly 
610 615 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1008 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG TCC GAT CAG CTG TTT GTC GCA TTT GAG AAG TAT TTC TTG ACG AGT 48 
Met Ser Asp Gin Leu Phe Val Ala Phe Glu Lys Tyr Phe Leu Thr Ser 
15 10 15 

AAC GAG GTC AAG AAG AAT GCA GCA ACG GAG ACA TGG ACA TTT TCA TCG 96 
Asn Glu Val Lys Lys Asn Ala Ala Thr Glu Thr Trp Thr Phe Ser Ser 
20 25 30 

TCC ATT TTC TTT GCC GTA ACC GTC GTC ACT ACC ATC GGA TAC GGT AAT 144 
Ser lie Phe Phe Ala Val Thr Val Val Thr Thr lie Gly Tyr Gly Asn 
35 40 45 

CCA GTT CCA GTG ACA AAC ATT GGA CGG ATA TGG TGT ATA TTG TTC TCC 192 
Pro Val Pro Val Thr Asn lie Gly Arg lie Trp Cys lie Leu Phe Ser 
50 55 60 

TTG CTT GGA ATA CCT CTA ACA CTG GTT ACC ATC GCT GAC TTG GCA GGT 240 
Leu Leu Gly lie Pro Leu Thr Leu Val Thr lie Ala Asp Leu Ala Gly 
65 70 75 80 

AAA TTC CTA TCT GAA CAT CTT GTT TGG TTG TAT GGA AAC TAT TTG AAA 2 88 

Lys Phe Leu Ser Glu His Leu Val Trp Leu Tyr Gly Asn Tyr Leu Lys 
85 90 95 

TTA AAA TAT CTC ATA TTG TCA CGA CAT CGA AAA GAA CGG AGA GAG CAC 3 36 

Leu Lys Tyr Leu lie Leu Ser Arg His Arg Lys Glu Arg Arg Glu His 
100 105 110 

GTT TGT GAG CAC TGT CAC AGT CAT GGA ATG GGG CAT GAT ATG AAT ATC 3 84 

Val Cys Glu His Cys His Ser His Gly Met Gly His Asp Met Asn lie 
115 120 125 



GAG GAG AAA AGA ATT CCT GCA TTC CTG GTA TTA GCT ATT CTG ATA GTA 
Glu Glu Lys Arg lie Pro Ala Phe Leu Val Leu Ala lie Leu lie Val 
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130 135 140 

TAT ACA GCG TTT GGC GGT GTC CTA ATG TCA AAA TTA GAG CCG TGG TCT 480 
Tyr Thr Ala Phe Gly Gly Val Leu Met Ser Lys Leu Glu Pro Trp Ser 
145 150 155 160 

TTC TTC ACT TCA TTC TAC TGG TCC TTC ATT ACA ATG ACT ACT GTC GGG 52 8 

Phe Phe Thr Ser Phe Tyr Trp Ser Phe lie Thr Met Thr Thr Val Gly 
165 170 175 

TTT GGC GAC TTG ATG CCC AGA AGG GAC GGA TAC ATG TAT ATC ATA TTG 576 
Phe Gly Asp Leu Met Pro Arg Arg Asp Gly Tyr Met Tyr lie lie Leu 
180 185 190 

CTC TAT ATC ATT TTA GGT AAA TTT TCA ATG AAA AAA AAA CAA AAA TTC 624 
Leu Tyr lie lie Leu Gly Lys Phe Ser Met Lys Lys Lys Gin Lys Phe 
195 200 205 

AAA ATA TTT TTA GGT CTT GCA ATA ACT ACA ATG TGC ATT GAT TTG GTA 672 
Lys lie Phe Leu Gly Leu Ala lie Thr Thr Met Cys lie Asp Leu Val 
210 215 220 

GGA GTA CAG TAT ATT CGA AAG ATT CAT TAT TTC GGA AGA AAA ATT CAA 72 0 

Gly Val Gin Tyr lie Arg Lys lie His Tyr Phe Gly Arg Lys lie Gin 
225 230 235 240 

GAC GCT AGA TCT GCA TTG GCG GTT GTA GGA GGA AAG GTA GTC CTT GTA 768 
Asp Ala Arg Ser Ala Leu Ala Val Val Gly Gly Lys Val Val Leu Val 
245 250 255 

TCA GAA CTC TAC GCA AAT TTA ATG CAA AAG CGA GCT CGT AAC ATG TCC 816 
Ser Glu Leu Tyr Ala Asn Leu Met Gin Lys Arg Ala Arg Asn Met Ser 
260 265 270 

CGA GAA GCT TTT ATA GTG GAG AAT CTC TAT GTT TCC AAA CAC ATC ATA 864 
Arg Glu Ala Phe lie Val Glu Asn Leu Tyr Val Ser Lys His lie lie 
275 280 285 

CCA TTC ATA CCA ACT GAT ATC CGA TGT ATT CGA TAT ATT GAT CAA ACT 912 
Pro Phe lie Pro Thr Asp lie Arg Cys lie Arg Tyr lie Asp Gin Thr 
290 295 300 

GCC GAT GCT GCT ACC ATT TCC ACG TCA TCG TCT GCA ATT GAT ATG CAA 96 0 

Ala Asp Ala Ala Thr lie Ser Thr Ser Ser Ser Ala lie Asp Met Gin 
305 310 315 320 

AGT TGT AGA TTT TGT CAT TCA AGA TAT TCT CTC AAT CGT GCA TTC AAA 10 08 

Ser Cys Arg Phe Cys His Ser Arg Tyr Ser Leu Asn Arg Ala Phe Lys 
325 330 335 

TAG 1011 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ser Asp Gin Leu Phe Val Ala Phe Glu Lys Tyr Phe Leu Thr Ser 
15 10 15 



Asn Glu Val Lys Lys Asn Ala Ala Thr Glu Thr Trp Thr Phe Ser Ser 
20 25 30 



Ser He Phe Phe Ala Val Thr Val Val Thr Thr He Gly Tyr Gly Asn 
35 40 45 

Pro Val Pro Val Thr Asn He Gly Arg He Trp Cys He Leu Phe Ser 
50 55 60 

Leu Leu Gly He Pro Leu Thr Leu Val Thr He Ala Asp Leu Ala Gly 
65 70 75 80 

Lys Phe Leu Ser Glu His Leu Val Trp Leu Tyr Gly Asn Tyr Leu Lys 
85 90 95 

Leu Lys Tyr Leu He Leu Ser Arg His Arg Lys Glu Arg Arg Glu His 
100 105 HO 

Val Cys Glu His Cys His Ser His Gly Met Gly His Asp Met Asn He 
115 120 125 

Glu Glu Lys Arg He Pro Ala Phe Leu Val Leu Ala He Leu He Val 
130 135 140 

Tyr Thr Ala Phe Gly Gly Val Leu Met Ser Lys Leu Glu Pro Trp Ser 
145 150 155 160 

Phe Phe Thr Ser Phe Tyr Trp Ser Phe He Thr Met Thr Thr Val Gly 
165 170 175 

Phe Gly Asp Leu Met Pro Arg Arg Asp Gly Tyr Met Tyr He He Leu 
180 185 190 

Leu Tyr He He Leu Gly Lys Phe Ser Met Lys Lys Lys Gin Lys Phe 
195 200 205 

Lys He Phe Leu Gly Leu Ala He Thr Thr Met Cys He Asp Leu Val 
210 215 220 

Gly Val Gin Tyr He Arg Lys He His Tyr Phe Gly Arg Lys He Gin 
225 230 235 240 

Asp Ala Arg Ser Ala Leu Ala Val Val Gly Gly Lys Val Val Leu Val 
245 250 255 

Ser Glu Leu Tyr Ala Asn Leu Met Gin Lys Arg Ala Arg Asn Met Ser 
260 265 270 

Arg Glu Ala Phe He Val Glu Asn Leu Tyr Val Ser Lys His He He 
275 280 285 

Pro Phe He Pro Thr Asp He Arg Cys He Arg Tyr He Asp Gin Thr 
290 295 300 

Ala Asp Ala Ala Thr He Ser Thr Ser Ser Ser Ala He Asp Met Gin 
305 310 315 320 

Ser Cys Arg Phe Cys His Ser Arg Tyr Ser Leu Asn Arg Ala Phe Lys 
325 330 335 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TCCATTTTCT TTGCCGTAAC CGTCGTCACT ACCATCGGAT ACGGTAATCC A 
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(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
TCATTCTACT GGTCCTTCAT TACAATGACT ACTGTCGGGT TTGGCGACTT G 51 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 7 : 

Ala Phe Leu Phe Ser lie Glu Thr Gin Thr Thr lie Gly Tyr Gly Phe 
15 10 15 

Arg Cys Val Thr Asp Glu Cys Pro 
20 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ala Phe Leu Phe Ser Leu Glu Thr Gin Val Thr lie Gly Tyr Gly Phe 
15 10 15 

Arg Cys Val Thr Glu Gin Cys Ala 
20 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



Ala Phe Leu Phe Phe He Glu Thr Glu Ala Thr He Gly Tyr Gly Tyr 
15 10 15 



Arg Tyr He Thr Asp His Cys Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Ala Phe Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn 
15 10 15 

He Ser Pro Thr Thr Phe Ala Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ala Phe Trp Trp Ala Val Val Thr Met Thr Thr Val Gly Tyr Gly Asp 
15 10 15 

Met Thr Pro Val Gly Phe Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Ala Phe Trp Tyr Thr He Val Thr Met Thr Thr Leu Gly Tyr Gly Asp 
15 10 15 

Met Val Pro Glu Thr He Ala Gly 
20 

(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Ala Phe Trp Trp Ala Gly lie Thr Met Thr Thr Val Gly Tyr Gly Asp 
15 10 15 

lie Cys Pro Thr Thr Ala Leu Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Gly Leu Trp Trp Ala Leu Val Thr Met Thr Thr Val Gly Tyr Gly Asp 
15 10 15 

Met Ala Pro Lys Thr Tyr lie Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala Leu Tyr Phe Thr Met Thr Cys Met Thr Ser Val Gly Phe Gly Asn 
15 10 15 

Val Ala Ala Glu Thr Asp Asn Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Cys Val Tyr Phe Leu lie Val Thr Met Ser Thr Val Gly Tyr Gly Asp 
15 10 15 

Val Tyr Cys Glu Thr Val Leu Gly 
20 



INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

Ser Leu Tyr Thr Ser Tyr Val Thr Thr Thr Thr lie Gly Phe Gly Asp 
15 10 15 

Tyr Val Pro Thr Phe Gly Ala Asn 
20 

INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Ala Phe Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn 
15 10 15 

lie Ser Pro Thr Thr Phe Ala Gly 
20 

INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ser lie Phe Phe Ala Val Thr Val Val Thr Thr lie Gly Tyr Gly Asn 
15 10 15 

Pro Val Pro Val Thr Asn Thr Gly 
20 

) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Ser Leu Tyr Thr Ser Tyr Val Thr Thr Thr Thr lie Gly Phe Gly Asp 
15 10 15 



Tyr Val Pro Thr Phe Gly Ala Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ser Phe Tyr Trp Ser Phe lie Thr Met Thr Thr Val Gly Phe Gly Asp 
15 10 15 

Leu Met Pro Arg Arg Asp Gly Tyr 
20 

(2) INFORMATION FOR SEQ ID NO: 22: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(i) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
ATAAAGCTTA AAAATGTCGC CGAATCGATG GAT 3 3 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AGCTCTAGAC CTCCATCTGG AAGCCCATGT 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
AAAAAGCTTA AAATGGCACA CATCACG 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
AAACTCGAGT CATACCTGTG GACT 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
AAAAAGCTTA AAATGGTCGG GCAATTG 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
AAAAGCATGC TCATCTGGAT GGGCA 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
AAAAAGCTTA AAATGGCCTC GGTCGCC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
TTTTCTAGAC TACATCGTTG TCTT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AAAAAGCTTA AAATGAATCT GATCAAC 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
AAATCTAGAT TAGTCGAAAC TGAA 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
AAAAAGCTTA AAATGCCTGG CGGA 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

AAATCTAGAG GCTACAGGAA GTCC 

(2) INFORMATION FOR SEQ ID NO:34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 7 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGGGGTACCA AAATGTCGGG GTGTGAT 27 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TTTTTCTAGA TCAAGAGTTA TCATC 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



ATGGTAATAA 


TCAACCGATC 


GAACACCTAT 


GCCGTTGAGC 


AGGAAGCATT 


TCCAAGAGAC 


60 


AAGTACAATA 


TTGTCTACTG 


GCTCGTCATT 


CTTGTTGGAT 


TCGGAGTTCT 


TCTGCCATGG 


120 


AATATGTTCA 


TTACTATCGC 


CCCTGAGTAT 


TATGTGAATT 


ATTGGTTCAA 


ACCGGATGGC 


180 


GTGGAGACAT 


GGTATTCGAA AGAATTCATG 


GGATCTTTGA 


CGATTGGCTC 


ACAACTTCCA 


240 


AACGCAAGCA 


TTAATGTTTT 


CAACCTGTTC 


CTCATTATTG 


CTGGTCCCCT 


GATCTACCGC 


300 


GTCTTTGCTC 


CGGTTTGCTT 


CAACATCGTC 


AACCTGACAA 


TCATTCTCAT 


CCTCGTCATT 


360 


GTTCTGGAGC 


CCACTGAAGA 


TTCCATGTCC 


TGGTTTTTCT 


GGGTAACTCT 


TGGAATGGCG 


420 


ACTTCAATCA ATTTTAGCAA 


TGGGCTATAT 


GAAAACTCGG 


TTTATGGAGT 


TGGTGGCGAT 


480 


TTTCCGCACA 


CCTACATTGG 


CGCTCTCTTG 


ATTGGAAACA ACATTTGCGG ATTGCTGATA 


540 


ACGGTTGTGA 


AAATCGGAGT 


GACCTATTTT 


CTGAATGATG 


AGCCTAAACT 


TGTTGCAATC 


600 


GTCTATTTCG 


GCATATCGTT 


GGTGATCCTT 


CTGGTGTGTG 


CAATTGCACT 


TTTCTTTATC 


660 


ACAAAGCAAG 


ATTTCTACCA 


CTATCACCAT 


CAAAAAGGAA 


TGGAAATTCG 


CGAAAAGGCG 


720 


GAAACCGACA 


GACCGTCTCC 


ATCCATTCTT 


TGGACCACAT 


TCACAAACTG 


TTATGGGCAA 


780 


CTCTTCAATG 


TTTGGTTCTG 


CTTTGCCGTT 


ACTCTCACAA 


TCTTCCCTGT 


TATGATGACC 


840 


GTTACCACTC 


GTGGAGATTC 


CGGCTTCCTA 


AACAAAATTA 


TGTCTGAAAA 


CGATGAAATC 


900 



TACACTTTGC 


TCACAAGTTT 


CCTCGTCTTC 


AATTTGTTCG 


CTGCGATTGG 


ATCCATAGTT 


960 


GCTTCCAAGA 


TTCACTGGCC 


GACACCCCGT 


TACCTCAAAT 


TTGCCATAAT 


CTTGCGTGCT 


1020 


CTTTTCATTC 


CATTCTTCTT 


CTTCTGCAAC 


TATCGTGTCC 


AGACGCGTGC 


TTATCCTGTT 


1080 


TTCTTTGAGT 


CTACTGACAT 


TTTTGTGATT 


GGTGGAATTG 


CCATGTCTTT 


TTCACATGGA 


1140 


TACCTCAGCG 


CTCTGGCAAT 


GGGATACACT 


CCAAACGTCG 


TGCCATCTCA 


CTACTCAAGA 


1200 


TTTGCCGCTC 


AGCTTTCCGT 


TTGCACTCTT 


ATGGTTGGCC 


TTCTCACCGG 


TGGCCTGTGG 


1260 


CCCGTTGTTA 


TTGAGCACTT 


CGTGGACAAG 


CCAAGTATCT 


TATAAATATT 


TATAGCATTA 


1320 


GAGTATACTT 


GTTATATGTT 


GTTTTTATTA 


AGCTGTGGAA 


TAAAATAATT 


ATTAAAAAAA 


1380 


AAAAAAAAAA 


AAAA 










1394 


(2) INFORMATION FOR SEQ ID NO: 37 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Ser Pro Asn Arg Trp lie Leu Leu Leu lie Phe Tyr lie Ser Tyr 
15 10 15 

Leu Met Phe Gly Ala Ala lie Tyr Tyr His lie Glu His Gly Glu Glu 
20 25 30 

Lys lie Ser Arg Ala Glu Gin Arg Lys Ala Gin lie Ala lie Asn Glu 
35 40 45 

Tyr Leu Leu Glu Glu Leu Gly Asp Lys Asn Thr Thr Thr Gin Asp Glu 
50 55 60 

lie Leu Gin Arg lie Ser Asp Tyr Cys Asp Lys Pro Val Thr Leu Pro 
65 70 75 80 

Pro Thr Tyr Asp Asp Thr Pro Tyr Thr Trp Thr Phe Tyr His Ala Phe 
85 90 95 

Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn lie Ser 
100 105 110 

Pro Thr Thr Phe Ala Gly Arg Met lie Met lie Ala Tyr Ser Val lie 
115 120 125 

Gly lie Pro Val Asn Gly lie Leu Phe Ala Gly Leu Gly Glu Tyr Phe 
130 135 140 

Gly Arg Thr Phe Glu Ala lie Tyr Arg Arg Tyr Lys Lys Tyr Lys Met 
145 150 155 160 

Ser Thr Asp Met His Tyr Val Pro Pro Gin Leu Gly Leu lie Thr Thr 
165 170 175 

Val Val lie Ala Leu lie Pro Gly lie Ala Leu Phe Leu Val Leu Pro 
180 185 190 

Cys Val Gly Val His Leu Leu Arg Glu Leu Gly Leu Ser Ser lie Ser 



195 200 205 

Leu Tyr Tyr Ser Tyr Val Thr lie Thr Thr lie Gly Phe Gly Asp Tyr 
210 215 220 

Val Pro Thr Phe Gly Ala Asn Gin Pro Lys Glu Phe Gly Gly Trp Phe 
225 230 235 240 

Val Val Tyr Gin lie Phe Val lie Val Trp Phe He Phe Ser Leu Gly 
245 250 255 

Tyr Leu Val Met He Met Thr Phe He Thr Arg Gly Leu Gin Ser Lys 
260 265 270 

Lys Leu Ala Tyr Leu Glu Gin Gin Leu Ser Ser Asn Leu Lys Ala Thr 
275 280 285 

Gin Asn Arg He Trp Ser Gly Val Thr Lys Asp Val Gly Tyr Leu Arg 
290 295 300 

Arg Met Leu Asn Glu Leu Tyr He Leu Lys Val Lys Pro Val Tyr Thr 
305 310 315 320 

Asp Val Asp He Ala Tyr Thr Leu Pro Arg Ser Asn Ser Pro Leu Ser 
325 330 335 

Met Tyr Arg Val Glu Pro Ala Pro He Pro Ser Arg Lys Arg Ala Phe 
340 345 350 

Ser Val Cys Ala Asp Met Val Gly Ala Gin Arg Glu Ala Gly Met Val 
355 360 365 

His Ala Asn Ser Asp Thr Asp Leu Thr Lys Leu Asp Arg Glu Lys Thr 
370 375 380 

Phe Glu Thr Ala Glu Ala Tyr His Gin Thr Thr Asp Leu Leu Ala Lys 
385 390 395 400 

Val Val Asn Ala Leu Ala Thr Val Lys Pro Pro Pro Ala Leu Gin Glu 
405 410 415 

Asp Ala Ala Leu Tyr Gly Gly Tyr His Gly Phe Ser Asp Ser Gin He 
420 425 430 

Leu Ala Ser Glu Trp Ser Phe Ser Thr Val Asn Glu Phe Thr Ser Pro 
435 440 445 

Arg Arg Pro Arg Ala Arg Ala Cys Ser Asp Phe Asn Leu Glu Ala Pro 
450 455 460 

Arg Trp Gin Ser Glu Arg Pro Leu Arg Ser Ser His Asn Glu Trp 
465 470 475 

INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



Met Ser Asp Gin Leu Phe Val Ala Phe Glu Lys Tyr Phe Leu Thr Ser 
15 10 15 



Asn Glu Val Lys Lys Asn Ala Ala Thr Glu Thr Trp Thr Phe Ser Ser 
20 25 30 

Ser lie Phe Phe Ala Val Thr Val Val Thr Thr lie Gly Tyr Gly Asn 
35 40 45 

Pro Val Pro Val Thr Asn lie Gly Arg lie Trp lie Leu Phe Ser Leu 
50 55 60 

lie Gly lie Pro Leu Thr Leu Val Thr lie Ala Leu Ala Gly Lys Phe 
65 70 75 80 

Leu Ser Glu His Leu Val Trp Leu Tyr Gly Asn Tyr Leu Lys Leu Lys 
85 90 95 

Tyr Leu lie Leu Ser Arg His Arg Lys Glu Arg Arg Glu His Val Cys 
100 105 110 

Glu His Cys His Ser His Gly Met Gly His Asp Met Asn lie Glu Glu 
115 120 125 

Lys Arg He Pro Ala Phe Leu Val Leu Ala He Leu He Val Tyr Thr 
130 135 140 

Ala Phe Gly Gly Val Leu Met Ser Lys Leu Glu Pro Trp Ser Phe Phe 
145 150 155 160 

Thr Ser Phe Tyr Trp Ser Phe He Thr Met Thr Thr Val Gly Phe Gly 
165 170 175 

Asp Leu Met Pro Arg Arg Asp Gly Tyr Met Tyr He He Leu Leu Tyr 
180 185 190 

He He Leu Gly Lys Phe Ser Met Lys Lys Lys Gin Lys Phe Lys He 
195 200 205 

Phe Leu Gly Leu Ala He Thr Thr Met Cys He Asp Leu Val Gly Val 
210 215 220 

Gin Tyr He Arg Lys He His Tyr Phe Gly Arg Lys He Gin Asp Ala 
225 230 235 240 

Arg Ser Ala Leu Ala Val Val Gly Gly Lys Val Val Leu Val Ser Glu 
245 250 255 

Leu Tyr Ala Asn Leu Met Gin Lys Arg Ala Arg Asn Met Ser Arg Glu 
260 265 270 

Ala Phe He Val Glu Asn Leu Tyr Val Ser Lys His He He Pro Phe 
275 280 285 

He Pro Thr Asp He Arg Cys He Arg Tyr He Asp Gin Thr Ala Asp 
290 295 300 

Ala Ala Thr He Ser Thr Ser Ser Ser Ala He Asp Met Gin Ser Cys 
305 310 315 320 

Arg Phe Cys His Ser Arg Tyr Ser Leu Asn Arg Ala Phe Lys Xaa 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
■(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 39: 
TNGGATATCT GGATGACTAT T 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
AGTCATCCAG ATAACTCCAG TACTAGTGT 
(2) INFORMATION FOR SEQ ID NO:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CGCAGGCAGA GCCACAAAGA GTACACAG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
GGAGATCAGC TAGGCACCAT ATTTGG 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
ATGCTGCATG CCTCATGCTT CCCAGC 



(2) INFORMATION FOR SEQ ID NO: 44: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GGTTATTTAA AGAGAGGGCT 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Leu Pro Ser Ala Ser Arg Glu Arg Pro Gly Tyr Arg Ala Gly Val 
15 10 15 

Ala Ala Pro Asp Leu Leu Asp Pro Lys Ser Ala Ala Gin Asn Ser Lys 
20 25 30 

Pro Arg Leu Ser Phe Ser Thr Lys Pro Thr Val Leu Ala Ser Arg Val 
35 40 45 

Glu Ser Asp Thr Thr lie Asn Val Met Lys Trp Lys Thr Val Ser Thr 
50 55 60 

lie Phe Leu Val Val Val Leu Tyr Leu lie He Gly Ala Thr Val Phe 
65 70 75 80 

Lys Ala Leu Glu Gin Pro His Glu He Ser Gin Arg Thr Thr He Val 
85 90 95 

He Gin Lys Gin Thr Phe He Ser Gin His Ser Cys Val Asn Ser Thr 
100 105 110 

Glu Leu Asp Glu Leu He Gin Gin He Val Ala Ala He Asn Ala Gly 
115 120 125 

He He Pro Leu Gly Asn Thr Ser Asn Gin He Ser His Trp Asp Leu 
130 135 140 

Gly Ser Ser Phe Phe Phe Ala Gly Thr Val He Thr Thr He Gly Phe 
145 150 155 160 

Gly Asn He Ser Pro Arg Thr Glu Gly Gly Lys He Phe Cys He He 
165 170 175 

Tyr Ala Leu Leu Gly He Pro Leu Phe Gly Phe Leu Leu Ala Gly Val 
180 185 190 

Gly Asp Gin Leu Gly Thr He Phe Gly Lys Gly He Ala Lys Val Glu 
195 200 205 



Asp Thr Phe lie Lys Trp Asn Val Ser Gin Thr Lys lie Arg lie lie 
210 215 220 

Ser Thr lie lie Phe lie Leu Phe Gly Cys Val Leu Phe Val Ala Leu 
225 230 235 240 

Pro Ala lie lie Phe Lys His lie Glu Gly Trp Ser Ala Leu Asp Ala 
245 250 255 

lie Tyr Phe Val Val lie Thr Leu Thr Thr lie Gly Phe Gly Asp Tyr 
260 265 270 

Val Ala Gly Gly Ser Asp lie Glu Tyr Leu Asp Phe Tyr Lys Pro Val 
275 280 285 

Val Trp Phe Trp lie Leu Val Gly Leu Ala Tyr Phe Ala Ala Val Leu 
290 295 300 

Ser Met lie Gly Arg Leu Val Arg Val lie Ser Lys Lys Thr Lys Glu 
305 310 315 320 

Glu Val Gly Glu Phe Arg Ala His Ala Ala Glu Trp Thr Ala Asn Val 
325 330 335 

Thr Ala Glu Phe Lys Glu Thr Arg Arg Arg Leu Ser Val Glu lie Tyr 
340 345 350 

Asp Lys Phe Gin Arg Ala Thr Ser lie Lys Arg Lys Leu Ser Ala Glu 
355 360 365 

Leu Ala Gly Asn His Asn Gin Glu Leu Thr Pro Cys Arg Arg Thr Leu 
370 375 380 

Ser Val Asn His Leu Thr Ser Glu Arg Asp Val Leu Pro Pro Leu Leu 
385 390 395 400 

Lys Thr Glu Ser lie Tyr Leu Asn Gly Leu Ala Pro His Cys Ala Gly 
405 410 415 

Glu Glu lie Ala Val He Glu Asn He Lys 
420 425 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2130 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

CCATCCTAAT ACGACTCACT ATAGGGCTCG AGCGNCCGCC CGGGCAGTAA AATGCCTGCC 6 0 

CGTGCAGCTC GGAGCGCGCA GCCCGTCTCT GAATAAGAAG TGAGTACAAT GGCGTGTTTG 12 0 

TAAAAAAAAG CTTCAAGTCC GTCTTTTTCA AAAAACATTT TGAATGCTGC ATGCCTCATG 18 0 

CTTCCCAGCG CCTCGCGGGA GAGACC CGGC TATAGAGCAG GAGTGGCGGC ACCTGACTTG 240 

CTGGATCCTA AATCTGCCGC TCAGAACTCC AAAC CGAGGC TCTCATTTTC CACGAAACCC 30 0 

ACAGTGCTTG CTTCCCGGGT GGAGAGTGAC ACGACCATTA ATGTTATGAA ATGGAAGACG 36 0 

GTCTCCACGA TATTCCTGGT GGTTGTCCTC TATCTGATCA TCGGAGCCAC CGTGTTCAAA 42 0 



GCATTGGAGC 


AGCCTCATGA 


GATTTCACAG 


AGGACCACCA 


TTGTGATCCA 


GAAGCAAACA 


480 


TTCATATCCC 


AACATTCCTG 


TGTCAATTCG 


ACGGAGCTGG 


ATGAACTCAT 


TCAGCAAATA 


540 


GTGGCAGCAA 


TAAATGCAGG 


GATTATACCG 


TTAGGAAACA 


CCTCCAATCA 


AATCAGTCAC 


600 


TGGGATTTGG 


GAAGTTCCTT 


CTTCTTTGCT 


GGCACTGTTA 


TTACAACCAT 


AGGATTTGGA 


660 


AACATCTCAC 


CACGCACAGA 


AGGCGGCAAA 


ATATTCTGTA 


TCATCTATGC 


CTTACTGGGA 


720 


ATTCCCCTCT 


TTGGTTTTCT 


CTTGGCTGGA 


GTTGGAGATC 


AGCTAGGCAC 


CATATTTGGA 


780 


AAAGGAATTG 


CCAAAGTGGA 


AGATACGTTT 


ATTAAGTGGA 


ATGTTAGTCA 


GACCAAGATT 


840 


CGCATCATCT 


CAACAATCAT 


ATTTATACTA 


TTTGGCTGTG 


TACTCTTTGT 


GGCTCTGCCT 


900 


GCGATCATAT 


TCAAACACAT 


AGAAGGCTGG 


AGTGCCCTGG 


ACGCCATTTA 


TTTTGTGGTT 


960 


ATCACTCTAA 


CAACTATTGG 


ATTTGGTGAC 


TACGTTGCAG 


GTGGATCCGA 


TATTGAATAT 


1020 


CTGGACTTCT 


ATAAGCCTGT 


CGTGTGGTTC 


TGGATCCTTG 


TAGGGCTTGC 


TTACTTTGCT 


1080 


GCTGTCCTGA 


GCATGATTGG 


GAGATTGGTC 


CGAGTGATAT 


CTAAAAAGAC 


AAAAGAAGAG 


1140 


GTGGGAGAGT 


TCAGAGCACA 


CGCTGCTGAG 


TGGACAGCCA ACGTCACAGC 


CGAATTCAAA 


1200 


GAAACCAGGA 


GGCGACTGAG 


TGTGGAGATT 


TATGACAAGT 


TCCAGCGGGC 


CACCTCCATC 


1260 


AAGCGGAAGC 


TCTCGGCAGA 


ACTGGCTGGA 


AACCACAATC 


AGGAGCTGAC 


TCCTTGTAGG 


1320 


AGGACCCTGT 


CAGTGAACCA 


CCTGACCAGC 


GAGAGGGATG 


TCTTGCCTCC 


CTTACTGAAG 


1380 


ACTGAGAGTA 


TCTATCTGAA 


TGGTTTGGCG 


CCACACTGTG 


CTGGTGAAGA 


GATTGCTGTG 


1440 


ATTGAGAACA 


TCAAATAGCC 


CTCTCTTTAA 


ATAACCTTAG 


GCATAGCCAT 


AGGTGAGGAC 


1500 


TTCTCTATGC 


TCTTTATGAC 


TGTTGCTGGT 


AGCATTTTTT 


AAATTGTGCA 


TGAGCTCAAA 


1560 


GGGGGAACAA 


AATAGATACA 


CCCATCATGG 


TCATCTATCA 


TCAAGAGAAT 


TTGGAATTCT 


1620 


GAGCCAGCAC 


TTTCTTTCTG 


ATGATGCTTG 


TTGAACGGCC 


CACTTTCTTT 


GATGAGTGGA 


1680 


ATGACAAGCA 


ATGTCTGATG 


CCTTTGTGTG 


CCCAGACTGT 


TTTCCTCTCT 


CTTTCCCTAA 


1740 


TGTGCCATAA 


GGCCTCAGAA 


TGAATTGAGA 


ATTGTTTCTG 


GTAACAATGT 


AGCTTTGAGG 


1800 


GATCAGTTCT 


TAACTTTTCA 


GGGTCTACCT 


AACTGAGCCT 


AGATATGGAC 


CATTTATGGA 


1860 


TGACAACAAT 


TTTTTTTTTG 


TAAATGACAA 


GAAATTCTTA 


TGCAGCCTTT 


TACCTAAGAA 


1920 


ATTTCTGTCA 


GTGCCTTATC 


TTATGAAGAA 


ACAGAACCTC 


TCTAGCTAAT 


GTGTGGTTTC 


1980 


TCCTTCCCTG 


CCCCCACCCC 


TAGGCTCACC 


TCTGCAGTCT 


TTTACCCCAG 


TTCTCCCATT 


2040 


TGAAT AC CAT 


ACCTTGNTGG 


AAACAGNGTG 


TAAAATGACT 


GAAGTGATGA 


TGCCGAAGAT 


2100 


GAAATAGATG 


NCAAATTAGN 


TGGACATTGA 






2130 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
AAAAGATCTA AAATGCTTCC CAGCGCC 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
AAAGTCGACC TATTTGATGT TCTCAAT 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 
AAAAAGCTTA AAATGCTTCC CAGCGCC 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AAATCTAGAC TATTTGATGT TCTCAAT 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
AACAAAAACC TTTTTTGTTT TGAATGGCCT AGAGAGGGTA AGGGATCCCC TGACGAACAG 
GAGCAGAGCC AGCTAGAACC TGGGCCTGGC CAGTTCAAGG CCACCAGAGG GCAGCCTTCT 



GCGGAAGGCA GTATTGGGGT AGGCAGGGAC CCCAGCAGAC ATGGCACTCA GAGCTCTCAC 180 

TGTCCACTGA CTCTCTCTTC TCCAGGTTAT GGCCACATGG CCCCACTATC GCCAGGCGGA 240 

AAGGCCTTCT GCATGGTCTT ATAGCCCTTG GGCTGCCAGC CTCCTTAGCT CTCGTGGCCA 3 00 

CCCTGCGCCA TTGCCTGCTG CCTGTGCTCA GCCGCCCACG TGCCTGGGTA GCGGTCCACT 360 

GGCAGCTGTC ACCGGCCAGG GCTGCGCTGC TGCAGGCAGT TGCACTGGGA CTGCTGGTGG 42 0 

CCAGCAGCTT TGTGCTGCTG CCAGCGCTGG TGCTGTGGGG CCTTCAGGGC GACTGCAGCC 480 

TGCTGGGGGC CGTCTACTTC TGCTTCAGCT CGCTCAGCAC CATTGGCCTG GGG 533 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



ATGATACGAT 


TTAATACGAC 


TCACTATAGG 


GAATTTGGCC 


CTCGAGGCCA 


AGAATTCGGC 


60 


ACGAGGAGAA 


TGTGCGCACG 


TTGGCTCTCA 


TCGTGTGCAC 


CTTCACCTAC 


CTGCTGGTGG 


120 


GCGCCGCGGT 


GTTCGACGCA 


CTGGAGTCGG 


AGC CGGAGAT 


GATCGAGCGG 


CAGCGGCTGG 


180 


AGCTGCGGCA 


GCTGGAGCTG 


CGGGCGCGCT 


ACAACCTCAG 


CGAGGGCGGC 


TACGAGGAGC 


240 


TGGAGCGCGT 


CGTGCTGCGC 


CTCAAGCCGC 


ACAAGGCCGG 


CGTGCAGTGG 


CGCTTCGCCG 


300 


GCTCCTTCTA 


CTTCGCCATC 


ACCGTCATCA 


CCACCATCGG 


CTATGGTCAT 


GCGGCGCCCA 


360 


GCACGGACGG 


AGGCAAGGTG 


TTCTGCATGT 


TCTACGCGCT 


GCTGGGCATC 


CCGCTCACAC 


420 


TAGTCATGTT 


CCAGAGCCTG 


GGTGAACGCA 


TCAACACCTC 


CGTGAGGTAC 


CTGCTGCACC 


480 


GTGCCAAGAG 


GGGGCTGGGC 


ATGCGGCACG 


CCGAAGTGTC 


CATGGCCAAC 


ATGGTGCTCA 


540 


TCGGTTTCGT 


GTCGTGCATC 


AGCACGCTGT 


GCATCGGCGC 


AGCTGCCTTC 


TCCTACTACG 


600 


AGCGCTGGAC 


TTTCTTCCAG 


GCCTATTACT 


ACTGCTTCAT 


CACCCTCACC 


ACCATCGGCT 


660 


TCGGCGACTA 


TGTGGCGCTG 


C AG AAGG AC C 


AGGCGCTGCA 


GACGCAGCCG 


CAGTATGTGG 


720 


CTTCAGCTTC 


GTGTACATCC 


TCACGGGCTC 


ACGGTCATCG 


GCGCTTCCTC 


AACCTCGTGG 


780 


TGCTGCGATT 


CATGACCATG 


AACGCCGAGG 


ACGAGAAGCG 


TGATGCGGAG 


CACCGCGCCC 


840 


TGCTCACGCA 


CAACGGCCAG 


GCTGTCGGCC 


TGGGTGGCCT 


GAGCTGCCTG 


AGCGGTAGCC 


900 


TGGGCGACGG 


CGTGCGTCCC 


CGCGACCCAG 


TCACATGCGC 


TGCGGCCGCA 


AGCTTA 


956 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

CTGAAACCAT GGGCCCGATA CCTGCTCCTG CTTATGGCCC ACCTGCTGGC CATGGGCCTT 60 

GGGGCTGTGG TGCTTCAGGC CCTGGAGGGC CCTCCAGCTC GCCACCTCCA GGCCCAGGTC 12 0 

CAGGCTGAAC TGGCTAGCTT CCAGGCAGAG CACAGGGCCT GCTTGCCACC TGAGGCCCTG 180 

GAGGAGCTGC TAGGTGCGGT CCTGAGAGCA CAGGCCCATG GAGTTTCCAG CCTGGGCAAC 240 

AGCTCAAGAC AAGCAACTGG GATCTGCCCT CAGCTCTGCT GTTCACTGCC AGCATCCTCA 3 00 

CCACCACCGG TTATGGCCAC ATGGCCCCAC TCTCCTCAGG TGGAAAGGCC TTCTGTGTGG 360 

TCTATGCAGC CCTTGGGCTG CCAGCCTCTC TAGCACTTGT GGCTGCCCTG CGCCACTGCT 42 0 

TGCTGCCTGT GTTCAGTCGC CCAGGTGACT GGGTAGCCAT TCGCTGGCAG CTGGCACCAG 48 0 

CTCAGGCTGC TCTGCTACAG GCAGCAGGAC TGGGCCTCCT GGTGGCCTGT GTCTTCATGC 54 0 

TGCTGCCAGC ACTGGTGCTG TGGGGTGTAC AGGGTGACTG GCAGCCTGCT AAACCATCTA 600 

CTTCTGTTTC GGCTCACTCA GCACGATCGG CCTAGGAGAC TTGCTGCCTG CCCATGGACG 66 0 

TGGCCTGCAC CCAGCCATTT ACCACCTTGG GCAGTTTGCA CTTCTTGGTT ACTTGCTCCT 72 0 

GGGGCTCCTG GCCATGTTGT TAGCAGTAGA GACCTTCTCA GAGCTGCCTC AGGTCCGTGC 78 0 

CATGGTGAAA TTCTTTGGGC CCAGTGGCTC TAGAACCGAT GAAGATCAAG ATGGCATCCT 84 0 

AGGC CAAGAT GAGCTGGCTC TGAGCACTGT GCTGCCTGAC GCCCCAGTCT TGGGACCAAC 9 00 

CACCCCAGCC TGAGCGGGAG GCACCAAGGA GTGCTTGAAG AACATAGCAG AAGGGTTATG 960 

GGAATGAATA TGTCATGGGA TAATGTTAAT TTTAAAAATT AAATGGGCTG CTTAGCATGC 102 0 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 1052 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Asn Lys Asn Leu Phe Cys Phe Glu Trp Pro Arg Glu Gly Lys Gly Ser 
15 10 15 

Pro Asp Gin Glu Glu Gin Ser Gin Leu Glu Pro Gly Pro Gly Gin Phe 
20 25 30 

Lys Ala Thr Arg Gly Gin Pro Ser Ala Glu Gly Ser lie Gly Val Gly 
35 40 45 

Arg Asp Pro Ser Arg His Gly Thr Gin Ser Ser His Cys Pro Leu Thr 
50 55 60 

Leu Ser Ser Pro Gly Tyr Gly His Met Ala Pro Leu Ser Pro Gly Gly 
65 70 75 80 



Lys Ala Phe Cys Met 
85 

Ala Leu Val Ala Thr 
100 

Pro Arg Ala Trp Val 
115 

Ala Leu Leu Gin Ala 
130 

Val Leu Leu Pro Ala 
145 

Leu Leu Gly Ala Val 
165 

Leu Gly 



Val Leu Xaa Ala Leu Gly 
90 

Leu Arg His Cys Leu Leu 
105 

Ala Val His Trp Gin Leu 
120 

Val Ala Leu Gly Leu Leu 
135 

Leu Val Leu Trp Gly Leu 
150 155 

Tyr Phe Cys Phe Ser Ser 
170 



Leu Pro Ala Ser Leu 
95 

Pro Val Leu Ser Arg 
110 

Ser Pro Ala Arg Ala 
125 

Val Ala Ser Ser Phe 
140 

Gin Gly Asp Cys Ser 
160 

Leu Ser Thr lie Gly 
175 



INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Gly lie Trp Pro Ser Arg Pro Arg lie Arg His Glu Glu Asn Val Arg 
15 10 15 

Thr Leu Ala Leu lie Val Cys Thr Phe Thr Tyr Leu Leu Val Gly Ala 
20 25 30 

Ala Val Phe Asp Ala Leu Glu Ser Glu Pro Glu Met lie Glu Arg Gin 
35 40 45 

Arg Leu Glu Leu Arg Gin Leu Glu Leu Arg Ala Arg Tyr Asn Leu Ser 
50 55 60 

Glu Gly Gly Tyr Glu Glu Leu Glu Arg Val Val Leu Arg Leu Lys Pro 
65 70 75 80 

His Lys Ala Gly Val Gin Trp Arg Phe Ala Gly Ser Phe Tyr Phe Ala 
85 90 95 

lie Thr Val He Thr Thr He Gly Tyr Gly His Ala Ala Pro Ser Thr 
100 105 110 

Asp Gly Gly Lys Val Phe Cys Met Phe Cys Met Phe Tyr Ala Leu Leu 
115 120 125 

Gly He Pro Leu Thr Leu Val Met Phe Gin Ser Leu Gly Glu Arg He 
130 135 140 

Asn Thr Ser Val Arg Tyr Leu Leu His Arg Ala Lys Arg Gly Leu Gly 
145 150 155 160 

Met Arg His Ala Glu Val Ser Met Ala Asn Met Val Leu He Gly Phe 
165 170 175 

Val Ser Cys He Ser Thr Leu Cys He Gly Ala Ala Ala Phe Ser Tyr 



180 185 190 

Tyr Glu Arg Trp Thr Phe Phe Gin Ala Tyr Tyr Tyr Cys Phe lie Thr 
195 200 205 

Leu Thr Thr lie Gly Phe Gly Asp Tyr Val Ala Leu Gin Lys Asp Gin 
210 215 220 

Ala Leu Gin Thr Gin Pro Gin Tyr Val Ala Ser Ala Ser Cys Thr Ser 
225 230 235 240 

Ser Arg Ala His Gly His Arg Arg Phe Leu Asn Leu Val Val Leu Arg 
245 250 255 

Phe Met Thr Met Asn Ala Glu Asp Glu Lys Arg Asp Ala Glu His Arg 
260 265 270 

Ala Leu Leu Thr His Asn Gly Gin Ala Val Gly Leu Gly Gly Leu Ser 
275 280 285 

Cys Leu Ser Gly Ser Leu Gly Asp Gly Val Arg Pro Arg Asp Pro Val 
290 295 300 

Thr Cys Ala Ala Ala Ala Ser Leu 
305 310 

(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 04 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Leu Lys Pro Trp Ala Arg Tyr Leu Leu Leu Leu Met Ala His Leu Leu 
15 10 15 

Ala Met Gly Leu Gly Ala Val Val Leu Gin Ala Leu Glu Gly Pro Pro 
20 25 30 

Ala Arg His Leu Gin Ala Gin Val Gin Ala Glu Leu Ala Ser Phe Gin 
35 40 45 

Ala Glu His Arg Ala Cys Leu Pro Pro Glu Ala Leu Glu Glu Leu Leu 
50 55 60 

Gly Ala Val Leu Arg Ala Gin Ala His Gly Val Ser Ser Leu Gly Asn 
65 70 75 80 

Ser Ser Xaa Thr Ser Asn Trp Asp Leu Pro Ser Ala Leu Leu Phe Thr 
85 90 95 

Ala Ser lie Leu Thr Thr Thr Gly Tyr Gly His Met Ala Pro Leu Ser 
100 105 110 

Ser Gly Gly Lys Ala Phe Cys Val Val Tyr Ala Ala Leu Gly Leu Pro 
115 120 125 

Ala Ser Leu Ala Leu Val Ala Ala Leu Arg His Cys Leu Leu Pro Val 
130 135 140 

Phe Ser Arg Pro Gly Asp Trp Val Ala lie Arg Trp Gin Leu Ala Pro 
145 150 155 160 



Ala Gin Ala Ala 



Cys Val Phe Met 
180 

Asp Trp Gin Pro 
195 

Thr lie Gly Leu 
210 

Pro Ala lie Tyr 
225 

Leu Gly Leu Leu 



Pro Gin Val Arg 
260 

Thr Asp Glu Asp 
275 

Ser Thr Val Leu 
290 



Leu Leu Gin Ala 
165 

Leu Leu Pro Ala 



Ala Xaa Thr lie 
200 

Gly Asp Leu Leu 
215 

His Leu Gly Gin 
230 

Ala Met Leu Leu 
245 

Ala Met Val Lys 



Gin Asp Gly lie 
280 

Pro Asp Ala Pro 
295 



Ala Gly Leu Gly 
170 

Leu Val Leu Trp 
185 

Tyr Phe Cys Phe 



Pro Ala His Gly 
220 

Phe Ala Leu Leu 
235 

Ala Val Glu Thr 
250 

Phe Phe Gly Pro 
265 

Leu Gly Gin Asp 



Val Leu Gly Pro 
300 



Leu Leu Val Ala 
175 

Gly Val Gin Gly 
190 

Gly Ser Leu Ser 
205 

Arg Gly Leu His 



Gly Tyr Leu Leu 
240 

Phe Ser Glu Leu 
255 

Ser Gly Ser Arg 
270 

Glu Leu Ala Leu 
285 

Thr Thr Pro Ala 



What is claimed is: 

1 . A potassium channel comprising four hydrophobic domains capable of 
forming transmembrane helices, wherein 

(i) a first pore-forming domain is interposed between a first and a 
second transmembrane helix; and 

(ii) a second pore-forming domain is interposed between a third and a 
fourth transmembrane helix. 

2. The potassium channel of Claim 1 wherein each pore-forming domain 
comprises a potassium selective peptide motif selected from the group consisting 
of dipeptide motifs and tripeptide motifs. 

3. The potassium channel of Claim 2 wherein the peptide motif comprises 
GXG wherein X is selected from the group of amino acids V ? L, Y, F ? M, or I. 

4. The potassium channel of Claim 3 wherein the pore-forming domain 
comprises ZXXL^Z^GXG wherein 

(i) Z through Z 2 are amino acid residues comprising T or S; 

(ii) Z 3 is an amino acid residue comprising I or V; and 

(iii) X is an amino acid residue comprising V, L, Y, F, M ? or I. 

5. The potassium channel of Claim 4 where X is L or I. 

6. The potassium channel of Claims 1, 2 ? 3, 4 ? or 5 wherein at least one 
pore-forming domain is positioned proximal to an exterior portion of a cell 
membrane. 
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7. The potassium channel of Claim 5 further comprising an amino acid 
motif ZX 1 X 2 X 3 GX 4 PX 5 downstream of said first pore-forming domain. 

8. The potassium channel of Claim 7 wherein ZXJX2X3GX4PX5 is 
positioned about 12-25 amino acids downstream of said first pore-forming 
domain. 

9. The potassium channel of Claim 8 wherein ZX 1 X 2 X 3 GX4PX5 is 
positioned within the second transmembrane domain. 

10. The potassium channel of Claim 8 or 9 wherein ZX 1 X 2 X 3 GX 4 PX 5 is 
positioned beginning about 16 amino acids downstream of said first pore-forming 
domain. 

1 1 . The potassium channel of Claim 8, 9 or 10 wherein a second 
ZX 1 X 2 X 3 GX4PX5 peptide is located within said second pore-forming region. 

12. The potassium channel of Claim 8 ? 9, or 10 wherein ZX X X 2 X 3 comprises 
the amino acids YALL. 

13. The potassium channels of Claim 12 wherein ZX 1 X 2 X 3 GX 4 P comprises 
the amino acids YALLGIP. 

14. The potassium channel of Claim 4 further comprising a glycosylation 
site. 
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15. The potassium channel of Claim 14 wherein said glycosylation site is 
asparagine-linked. 

16. The potassium channel of Claims 1, 2, 3, 4, 5, 7, or 8 characterized in 
that it is derived from invertebrates. 

17. The potassium channel of Claim 16 characterized in that it is insect- 
derived. 

18. The potassium channel of Claim 16 characterized in that it is nematode- 
derived. 

19. The potassium channel of Claims 1, 2, 3, 4 5 5, 6, 7, or 8 characterized in 
that it is derived from vertebrates. 

20. The potassium channel of Claim 19 characterized in that it is mammalian 
derived. 

21 . The potassium channel of Claim 20 characterized in that it is human 
derived. 

22. An isolated nucleotide sequence capable of encoding a protein designated 
CORK. 

23 . An isolated nucleotide sequence capable of encoding a protein designated 
hORK. 
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24. An isolated nucleotide sequence comprising 

(i) a nucleotide sequence depicted in SEQ ID NO 1 or 36; 

(ii) a nucleotide sequence that hybridizes to said sequence depicted in 
SEQ IDNO:l or 36; 

(iii) a nucleotide sequence that is degenerate to the nucleotide sequence 
depicted in SEQ ID NO:l or 36; and 

(iv) a functional derivative of the nucleotide sequence depicted in SEQ 
IDNO:l or 36. 

25. An isolated nucleotide sequence comprising 

(i) a nucleotide sequence depicted in SEQ ID NO:46; 

(ii) a nucleotide sequence that hybridizes to said sequence depicted in 
SEQ ID NO:46; 

(iii) a nucleotide sequence that is degenerate to the nucleotide sequence 
depicted in SEQ ID NO:46; and 

(iv) a functional derivative of the nucleotide sequence depicted in SEQ 
ID NO:46. 

26. An isolated nucleotide sequence comprising 

(i) a nucleotide sequence depicted in SEQ ID NO:51, 52 or 53; 

(ii) a nucleotide sequence that hybridizes to said sequence depicted in 
SEQ ID NO:51,52 or 53; 

(iii) a nucleotide sequence that is degenerate to the nucleotide sequence 
depicted in SEQ ID NO:51, 52 or 53; and 

(iv) a functional derivative of the nucleotide sequence depicted in SEQ 
ID NO:52, 52, or 53. 
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27. An expression vector capable of expressing the potassium channel of 
Claim 16 in a cell membrane of a yeast cell. 

28. An expression vector capable of expressing the potassium channel of 
Claim 19 in a cell membrane of a yeast cell. 

29. An expression vector capable of expressing the potassium channel 
encoded by the nucleotide sequence of Claims 24, 25, or 26 in a cell membrane of 
a yeast cell 

30. A transformed yeast cell comprising the expression vector of Claims 27, 
28, or 29. 

31. A method of assaying substances to determine effects on cell growth, the 
method comprising the steps of: 

a) preparing cultures of yeast cells in a medium adequate to support 
growth of potassium-dependent mutant strains expressing the 
potassium channel of Claim 1 ; 

b) contacting said substance to a portion of said yeast cells thereafter 
permitting sufficient time for continued growth, if any, of the 
portion of yeast cells so contacted as well as the portion not 
contacted with said substance; 

c) identifying zones of growth around the substances, wherein the 
level of growth indicates whether or not activity of the 
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heterologous potassium channel has been modulated as compared 
to yeast cells not contacted with said substances. 

32. The method of Claim 3 1 wherein said yeast cells comprise the nucleotide 
sequence of Claims 24, 25, or 26. 

33. A kit comprising the nucleotide sequences of Claim 32. 

34. A method of modulating the activity of the potassium channel of Claim 19, 
positioned in a cellular membrane of a living organism by contacting said cellular 
membrane with a substance, in an amount and for a period of time sufficient to 
modify the ability of potassium ions to pass through said channel positioned in said 
cellular membrane of the living organism. 

35. A method of modulating cardiac activity, by applying to a patient in need of 
such cardiac modulation, a substance capable of interacting with a potassium channel 
contained in the cardiac cells of such patient that is biologically equivalent to the 
potassium channel encoded by SEQ ID NO:l or 46, and modulating the activity of 
same. 

36. The potassium channel of Claim 7 capable of rectifying the inward and 
outward flow of ions. 

37. The potassium channel of Claim 7 capable of rectifying the outward flow of 
ions. 
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38. The potassium channel of Claim 36 or 37 wherein direction and magnitude 
of potassium current is modulated by external potassium in concentration. 

39. The potassium channel of Claim 36 or 37 wherein potassium is the permeant 
ion. 

40. A method of chromosome mapping comprising 

(i) providing PCR primers from the nucleotide sequence of Claims 24, 25, 

or 26; 

(ii) performing a PCR assay of somatic cell hybrids containing 
chromosomes using the primers of step i); and 

(iii) detecting amplified fragments as a measure of the hybrids containing 
the gene corresponding to the primers. 

41 . A transgenic animal comprising the nucleotide sequence of Claims 24, 25, or 
26. 
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ABSTRACT 



POTASSIUM CHANNELS, NUCLEOTIDE SEQUENCES ENCODING 
THEM, AND METHODS OF USING SAME 

This invention relates generally to a new family of potassium channels. 
More particularly, the present invention relates to the cloning and characterization of 
a family of distinct trans-membrane potassium ion channels, characterization of such 
channels, newly identified polynucleotide sequences, polypeptides encoded by such 
sequences, expression vectors capable of heterologous expression of such 
polynucleotide sequences, transformed host cells containing the expression vectors, 
and assay methods and kits therefor for determining the expression of heterologous 
nucleotide sequences encoding all or a portion of said potassium channels in host 
cells, chromosome mapping, diagnostic methodologies and kits therefore. Genes 
encoding potassium channels representative of this family were cloned from 
Drosophila melanogaster, Caenorhabditis elegans, human and mouse ESTs, and 
human brain, heart and kidney cDNA libraries. More particularly, the invention arises 
in part from the determination that the DNA sequences of these genes encode a struc- 
turally distinct potassium channel whose molecular architecture is characterized by 
four membrane spanning domains and two putative pore forming domains. 
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O SC galactose. 100 mM KCI SC glucose, OmM KCI 



SC galactose, 0 mM KCI SC glucose. 100 mM KCI 



FIG. 1 
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